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DETECTION OF MOLECULAR INTERACTIONS BY 
REPORTER SUBUNIT COMPLEMENTATION 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority to U.S. Provisional Patent Applications Serial No. 
60/042,576 filed April 2, 1997 and Serial No. 60/054,638, filed August 4, 1997; and U.S. 
Patent Application Attorney Docket No. 28600-20206.00, filed April 1, 1998. 

STATEMENT OF RIGHTS TO INVENTIONS MADE 
UNDER FEDERALLY SPONSORED RESEARCH 
Not applicable. 

TECHNICAL FIELD 
This invention is in the field of molecular biology and, more specifically, in the 
field of reporter systems useful for the analysis of protein-protein interactions. 

BACKGROUND 

The p-galactosidase enzyme (p-gal), the protein product of the £ coli lacZ gene, is 
widely used in studies of gene expression and cell lineage in higher organisms. Several 
biochemical assays of P-gal activity, including live-cell flow cytometry and histochemical 
staining with the chromogenic substrate 5-bromo-4-chloro-3-indolyl 
p-D-galactopyranoside (X-gal) make the product of the lacZ gene extremely versatile as a 
quantitative reporter enzyme, selectable marker, or histological indicator. Bronstein et al. 
(1989) J. Biolumin. Chemilumin. 4:99-1 1 1 ; Nolan et al. (1988) Proc. Natl. Acad. Set USA 
85:2603-2607; and Lojda (1979) Enzyme Histochemistry: A Laboratory Manual, Springer, 
Berlin. One property of the lacZ system that has been well-characterized in studies of 
bacterial genetics, but has not been exploited in eukaryotes is the phenomenon of 
intracistronic complementation. Studies in E. coli have shown that deletions of p-gal 
which remove portions of either the N-terminus or the C-terminus produce enzyme which 
is inactive. However, coexpression of one of these deletion mutants with a second inactive 
deletion mutant containing domains that are lacking in the first can restore P-gal enzymatic 
activity in a process called complementation. This complemented p-gal activity arises by 
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concentration-dependent assembly of a stable hetero-octameric enzyme complex 
comprising all the essential domains of the wild-type homotetramer. Ullman et al (1965) 
J. Mol Biol ]2:918-923; Ullman etal (1967) 'J. Mol Biol 24:339-343; and Ullman et 
al (1968)*/ Mol Biol 32:1-13. 
5 A system utilizing P-gal complementation in enzyme assays has been described. 

Henderson, U.S. Patent 4,708,929. In this system, enzymatically inactive p-gal 
polypeptide fragments, capable of combining with high affinity to form active P-gal by 
complementation, are used. One of the fragments is conjugated to analyte, which allows it 
to compete with analyte for binding to an analyte-binding protein. If bound to the analyte- 

10 binding protein, the p-gal fragment is unable to complement. Thus, by comparing p-gal 

activity in the presence of sample to that obtained in the presence of a known concentration 
of analyte (at equal concentrations of analyte-binding protein) the amount of analyte in the 
sample may be determined. This method requires high-affinity complementing subunits of 
p-gal, requires that an analyte-binding protein be known, and is not applicable to single- 

15 cell analysis. 

Previous systems for the study of protein-protein interactions have been described 
which utilize two fusion genes whose products reconstitute the function of a transcriptional 
activator. Fields et al, (1989) Nature 340:245-247; Bai et al, (1996) Metk Enzymol 
273:331-347; Luo et al, (1997) BioTechnigues 22(2):350-352. In one fusion gene, a 

20 sequence encoding a first protein is conjugated to a sequence encoding a DNA-binding 

domain of a transcriptional regulatory protein. In a second fusion gene, a sequence 
encoding a second protein is conjugated to a sequence encoding a transcriptional activation 
domain of a transcriptional regulatory protein. The two fusion genes are co-transfected 
into a cell which also contains a reporter gene whose expression is controlled by a DNA 

25 regulatory sequence that is bound by the DNA-binding domain encoded by the first fusion 

gene. Expression of the reporter gene requires that a transcriptional activation domain be 
brought adjacent to the DNA regulatory sequence. Binding of the first protein to the 
second protein will bring the transcriptional activation domain encoded by the second 
fusion gene into proximity with the DNA-binding domain encoded by the first fusion gene, 

30 thereby stimulating transcription of the reporter gene. Thus, the level of expression of the 

reporter gene will reflect the degree of binding between the first and second proteins. 
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There are several disadvantages associated with the use of the above-mentioned 
system. As it is dependent upon transcriptionally-regulated expression of a reporter gene, 
this system is limited to the assay of interactions that take place in the nucleus. In 
addition, the assay is indirect, relying on transcriptional activation of a reporter gene whose 
product is diffusible. Hence, a method which would allow a direct and immediate 
examination of molecular interactions, at the site where they occur, would be desirable. 

A system for detecting protein-protein interactions, not limited to nuclear 
interactions, has been described. U.S. Patent Nos. 5,503,977 and 5,585,245. In this 
system, fusions between potential interacting polypeptides and mutant subunits of the 
protein ubiquitin are formed. Juxtaposition of the two ubiquitin subunits brought about by 
interaction between potential interacting polypeptides creates a substrate for a ubiquitin- 
specific protease, and a small peptide reporter fragment is released. In this system, binding 
between the potential interacting polypeptides does not generate any type of enzymatic 
activity; therefore, signal amplification is not possible. Additionally, the ubiquitin system 
does not measure activity in intact cells, but relies on assays of proteolysis in cell-free 
extracts. What is needed is a sensitive method for examining protein interactions in intact 
cells in the relevant cellular compartment. 

Fluorescence imaging has been used to study the intracellular biochemistry of 
living cells. A fluorescent indicator for the adenosine 3\5'-cyclic monophosphate (cAMP) 
signaling pathway has been described in which the sensor is a cAMP kinase in which the 
catalytic and regulatory subunits each are labeled with a different fluorescent dye, such as 
fluorescein or rhodamine, capable of fluorescence resonance energy transfer in the 
holoenzyme complex. A change in shape of the fluorescence emission spectrum occurs 
upon cAMP binding, and therefore activation of the kinase can be visualized in cells 
microinjected with the labeled holoenzyme. Adams et al, Nature, 349: 694-697 (1991). 
This system is limited by the fact that it requires microinjection, and a preferred distance 
between the labeled units for energy transfer to occur. 

Substrates for P-lactamase have been described in the art which include a 
fluorescent donor moiety and a quencher, which include an attached group which makes 
them permeable through cell membranes, wherein the attached group is hydrolyzed off 
after the substrate enters the cell. Fluorescence energy transfer between the donor and 
quencher is monitored as an indicator of P-lactamase activity. This system also can be 
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used in a reporter gene assay using cells containing p-lactamase reporter genes functionally 
linked to a promoter. PCT WO 96/30540 published October 3, 1996, the disclosure of 
which is incorporated herein. 

DISCLOSURE OF THE INVENTION 

The present invention provides methods and compositions for detecting, assaying 
and quantitating molecular interactions within living cells and in vitro, through 
complementation between two or more low affinity reporter subunits, such as distinct E. 
coli lacZ mutations. In a preferred embodiment, protein-protein interactions within living 
cells are detected and quantitated using the methods and compositions of the present 
invention. The practice of the present invention enables, for the first time, the study of 
protein-protein interactions and their control in living mammalian cells without reliance 
upon the transcriptional activation of a reporter gene construct. Association of the proteins 
of interest results directly in enzyme activity and is independent of other cellular functions. 
Therefore, the present invention provides advantages over other systems currently in use 
by allowing the detection of complexes that are excluded from the nucleus, and detection 
of complexes whose formation would inhibit transcription. Furthermore, the present 
invention allows the detection and localization of specific binding interactions within cells 
at different stages of development and differentiation, and an analysis of the induction or 
inhibition of binding interactions in cells. 

Interactions occurring within the nucleus of the cell, interactions occurring in the 
cytoplasm, on the cell surface, within or on the surface of organelles, or between 
cytoplasmic and surface (either cellular or organellar) molecules, as well a interactions 
occurring outside the cell, are all capable of being detected in the practice of the present 
invention. Thus, the invention surmounts the limitations associated with previous assays 
for protein-protein interactions, which were either limited to interactions occurring in the 
nucleus, or did not always allow accurate localization of molecular interactions, and which 
were not well-suited for detection of interactions which resulted in inhibition of 
transcription or translation. 

Accordingly, in one embodiment, the invention provides a reporter system 
component comprising: 

a first low-affinity reporter subunit, coupled to a first putative binding moiety; 
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wherein the first low-affinity reporter subunit is capable of association with at least 
a second low-affinity reporter subunit to generate a detectable signal, said association 
being mediated by the first putative binding moiety. 

In another embodiment, the invention provides a method of determining the 
occurrence of binding between first and second putative binding moieties, the method 
comprising; 

a) providing a reporter system comprising: 

a first component comprising a first low affinity reporter subunit, 
coupled to the first putative binding moiety; and 

a second component comprising a second low affinity reporter 
subunit coupled to the second putative binding moiety; 

wherein the first low affinity reporter subunit is capable of association with 
at least the second low affinity reporter subunit to generate a detectable signal, said 
association being mediated by the binding of the first and second putative binding 
moieties; 

b) combining the first component and the second component; and 

c) detecting the presence or absence of the signal. 

In a further embodiment, the invention provides a method of screening for binding 
of a first binding moiety with members of a plurality of different second putative binding 
moieties, the method comprising: 

a) providing a plurality of reporter systems each comprising: 

a first component comprising a first low affinity reporter subunit coupled to 
the first binding moiety, and 

one of a plurality of second components each comprising a second low 
affinity reporter subunit coupled to one of said plurality of second putative binding 
moieties, wherein in each of said second components, said second putative binding 
moiety is different; 

wherein the first low affinity reporter subunit is capable of association with 
the second low affinity reporter subunit to generate a detectable signal upon the 
binding of the first binding moiety with one of said different second putative 
binding moieties; 
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b) individually combining the first component with each of the plurality of 
second components to produce a plurality of binding assay samples, each of which 
includes the first component and a different one of the second components; and 

c) detecting the presence or absence of the signal in each of the binding assay 
samples. 

The invention additionally provides nucleic acids encoding fusion proteins 
including a low affinity reporter subunit and a putative binding moiety, and the fusion 
proteins encoded by said nucleic acids. The invention further provides viral vectors 
comprising nucleic acids encoding such fusions proteins. The invention also provides cells 
transformed by the nucleic acids and viral vectors described above. 

All patents, patent applications and publications referred to herein are hereby 
incorporated by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic illustration of three deletion mutant lacZ constructs, 
designated Act, Aco and A^i. 

Figure 2A is a schematic illustration of a viral construct encoding fusion proteins 
of the Act or Ao> p-gal mutants with either the intracellular FKBP-rapamycin associated 
protein (FRAP) or the intracellular rapamycin binding protein, FK506-binding protein- 12 
(FKBP12) upstream of the hygromycin or neomycin resistance genes. 

Figure 2B is a schematic illustration of a viral construct encoding fusion proteins 
of the Aa or Aa> P-gal mutants with either FRAP or FKBP12 and another protein, 
represented as x and x', upstream of the hygromycin or neomycin resistance genes. 

Figures 3A and 3B show X-gal staining of fixed cells expressing both FKBP12- 
Ag> and FRAP-Aa. Cells shown in 3b were exposed to 10 ng/ml rapamycin for 12 hr. 
Cells shown in 3a were not exposed to rapamycin. 

Figure 4A is a graph of P-gal activity vs. time with and without rapamycin 
treatment of C2C12 cells expressing both FKBP12-Aco and FRAP-Aa fusion proteins. 

Figure 4B is a graph of the dose-response to rapamycin of the activity of p-gal in 
C2C12 cells expressing both FKBP12-Aco and FRAP-Aa fusion proteins. 
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Figure 5 shows rapamycin-dependent increase in |3-gal activity in lysates from 
cells expressing both FKBP12-Aco and FRAP-Aa fusion proteins, measured by 
chemiluminescence. 

Figure 6A shows analysis by Fluorescence-Activated Cell Sorting (FACS) of 
5 C2C12 cells expressing both FKBP12-A© and FRAP-Aa after 90 minutes of rapamycin 

treatment Dark peaks represent profiles obtained from untreated samples; light peaks 
represent profiles from samples that have been treated with 10 ng/ml rapamycin. 

Figure 6B shows a FACS profile of untreated cells and indicates a subpopulation 
selected on the basis of low P-gal activity, 
10 Figure 6C shows FACS analysis of the subpopulation of cells selected in Figure 

6B after overnight culture in the absence (dark peak) or presence (light peak) of rapamycin. 
In Figures 6A, 6B and 6C, the vertical axis represents cell number and the horizontal axis 
represents intensity of p-gal fluorescence expressed on a logarithmic scale. 

Figure 7 shows EGF receptor dimerization monitored using P-gal 
15 complementation. 

Figure 7A depicts schematically the rationale of the assay: two weakly 
complementing deletion mutants of p-gal are linked to the extracellular and 
transmembrane domains of the EGF receptor. Receptor dimerization, stabilized by EGF, 
will drive p-gal complementation. 
20 Figure 7B shows the design of the retroviral constructs used in the assay. E. coli 

lacZ deletion mutants Act and A© were cloned into pWZL vectors expressing neomycin or 
hygromycin resistance, respectively. The extracellular and transmembrane (tm) domains 
of human EGF receptor were cloned in frame with the Aoc and Aco mutants. 

Figure 7C shows FACS analysis of a population of transduced and selected cells. 
25 EGF treatment increases the P-gal activity (fluorescein fluorescence) in a substantial 

proportion of the cells. The FACS profile of cells without EGF treatment is shaded in light 
gray and is outlined in white. The profile of cells treated with EGF is shaded dark gray. 

Figure 7D shows FACS analysis of chimeric receptor expression, using a 
monoclonal antibody to the extracellular domain of the human EGF receptor. The FACS 
30 profile of the transduced and selected population is shaded medium gray and outlined in 

white; untransduced cells are shaded light gray and outlined in white. The FACS was used 
to clone cells that had low P-gal activity in the absence of EGF and showed increased p-gal 
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activity in the presence of EGF. One clone that had low levels of the chimeric receptor 
relative to the population (shaded in dark gray) was used for further analyses. 

Figure 7E shows induction of EGF receptor dimerization (P-gal activity) in all of 
the cells of the clone selected in Figure 7D, upon treatment with 100 ng/ml EGF for two 
5 hours. Untreated cells are shaded in light gray and outlined in white; EGF treated cells are 

shaded in dark gray. 

Figure 7F shows that dimerization can be detected after very short treatments with 
EGF. Cells were treated with 100 ng/ml EGF for 0, 1, 4, 8, and 15 minutes before cells 
were rinsed and processed for FACS analysis. The mean fluorescence of the cell sample is 
10 plotted. 

Figure 8 shows a time-course of EGF receptor dimerization and receptor 
expression on the cell surface, following treatment with EGF. Cells expressing chimeric 
receptors were treated with 100 ng/ml EGF for 0 to 24 hours. Dimerization, as measured 
by p-gal activity, was monitored by FACS, and the mean P-gal activity (fluorescein 

15 fluorescence) of the cells was plotted (left-hand axis; — ■ — ). Chimeric receptor levels on 

the cell surface were measured on the FACS using a monoclonal antibody to the 
extracellular domain of the human EGF receptor and a phycoerythrin-labeled second 
antibody. Mean phycoerythrin fluorescence values are shown on the right-hand axis (-A- 
-). Triplicate samples were analyzed for each time point, and 5000 cells were analyzed for 

20 each sample. The error bars indicate the standard deviation of the replicate samples. 

Figure 9 shows that EGF receptor dimerization is enhanced by tyrphostin AG 1478. 
Figure 9A shows, in the left panel, schematic diagrams of different regimens for 
treatment of cells with EGF, tyrphostin, or both. After the various treatments, cells were 
analyzed on the FACS, and the mean fluorescence is shown in the right panel. Each 

25 treatment was performed in triplicate. 

Figure 9B shows measurements of P-galactosidase activity in EGF-treated cells 
compared with EGF+tyrphostin-treated cells. Cells expressing the chimeric receptor were 
treated with either 100 ng/ml EGF ( ■ ) or EGF and 100 nM tyrphostin AG 1478 for 0 
to 24 hours (-- A--). Triplicate samples were analyzed for each time point, and the error 

30 bars indicate the standard deviation of the replicate samples. 
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MODES FOR CARRYING OUT THE INVENTION 

Definitions 

As used herein, the following terms have the following definitions: 
As used herein, a "reporter subunit" refers to a member of a complex of two or 
5 more subunits which are capable of associating with low binding affinity with each other 

to generate a detectable signal, or which are capable of associating with each other and one 
or more additional substances to generate a detectable signal, and which do not 
individually generate the detectable signal. 

As used herein, "low affinity" reporter subunits refer to molecular species which 

10 have a sufficiently low binding affinity for each other such that when they each are 

covalently attached to two different binding moieties, they substantially do not become 
associated unless a binding interaction between the two binding moieties occurs. "Low 
affinity" thus generally refers to a binding affinity which is at least less than that of the 
attached binding moieties. 

1 5 As used herein, "binding moieties" refers to at least two molecular species, such as 

proteins or fragments thereof, which interact with each other to form a stable complex. 

As used herein, a "detectable signal" refers to any detectable signal which occurs 
upon the association of the reporter subunits or via the interaction of the associated 
subunits with another substance. The detectable signal may be for example, a 

20 chromogenic, fluorescent, phosphorescent or chemiluminescent signal, such as a detectable 

product of an enzymatic reaction catalyzed by the associated reporter subunits. 

The terms "protein", "polypeptide", and "peptide" are used interchangeably herein 
to refer to polymers of amino acids of any length. The polymer may be linear or branched, 
it may comprise modified amino acids, and it may be interrupted by non-amino acids. It 

25 also may be modified naturally or by intervention; for example, disulfide bond formation, 

glycosylation, myristylation, acetylation, alkylation, phosphorylation or 
dephosphorylation. Also included within the definition are polypeptides containing one or 
more analogs of an amino acid (including, for example, unnatural amino acids) as well as 
other modifications known in the art. 

30 Unless otherwise indicated, the practice of the present invention will employ 

conventional techniques of molecular biology, biochemistry, microbiology, recombinant 
DNA, nucleic acid hybridization, genetics, immunology, embryology and oncology which 
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are within the skill of the art. Such techniques are explained fully in the literature. See, for 
example, Maniatis, Fritsch & Sambrook, MOLECULAR CLONING: A LABORATORY 
MANUAL, Cold Spring Harbor Laboratory Press (1982); Sambrook, Fritsch & Maniatis, 
MOLECULAR CLONING: A LABORATORY MANUAL, Second Edition, Cold Spring 
Harbor Laboratory Press (1989); Ausubel, et a/., CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, John Wiley & Sons (1987, 1988, 1989, 1990, 1991, 1992, 
1993,1994, 1995, 1996). 
Reporter Subunits 

As used herein, a "reporter subunit" refers to a member of a complex of two or 
more subunits which are capable of associating with low binding affinity with each other 
to generate a detectable signal, or which are capable of associating with each other and one 
or more additional substances to generate a detectable signal, and which do not 
individually generate the detectable signal. 

The detectable signal thus provides an indication that the subunits have become 
associated. In general, in an assay of the binding affinity of a first and at least a second 
molecular species (the "putative binding moiety"), a first component is provided which 
includes one reporter subunit attached to the first molecular species, and a second 
component is provided which includes another of the same or different reporter subunit 
attached to the second molecular species. The reporter subunits preferably have 
sufficiently low binding affinity for each other such that they substantially do not associate 
with each other in solution unless and until the molecules for which binding affinity is 
being assayed have sufficient binding affinity to mediate complex formation between the 
two components. Upon binding of the binding moieties and resulting association of the 
reporter subunits, generally by non-covalent interactions, such as hydrogen bonding or 
hydrophobic interactions, for example, the reporter subunits are oriented close enough to 
each other such that they are capable of associating with low affinity and generating a 
detectable signal. In the system, individual reporter subunits are not able to generate the 
detectable signal. Thus, the reporter subunits undergo forced complementation when 
brought into close proximity. 

The reporter subunits can be designed to have a preferred low affinity for a 
particular application and for the conditions in which the binding assay is done. Binding 
of molecules will depend upon factors in solution such as pH, ionic strength, concentration 
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of components of the assay, and temperature. In the binding assays using reporter systems 
described herein, the binding affinity of the reporter subunits should be low enough to 
permit forced complementation. Non-limiting examples of dissociation constants of the 
reporter subunits in an assay solution, such as a buffered system or cell interior, are on the 
order of greater than about 10" M for example, greater than 10" 6 M or optionally, between 
about 10' 2 to 10' 5 M depending upon the properties of the particular assay system. 

Reporter subunits which have sufficiendy low binding affinity, and yet are still 
capable of associating and generating a detectable signal upon the binding of molecular 
species attached to them can be designed as disclosed herein. Reporter subunits which can 
be used include any low binding affinity subunits which are capable of associating to 
produce a detectable signal. In one preferred embodiment, the reporter subunits are 
proteins which are capable of associating and are capable when associated of catalyzing a 
reaction which produces a directly or indirectly detectable product. 

Protein enzymes capable of catalyzing conversion of a substrate to a detectable 
reaction product, either directly or indirectly, which have been used, for example, in cell 
based screening assays may be used as reporter subunits. The enzymes can be modified 
into reporter subunits and to have a low binding affinity and the ability to undergo forced 
complementation. These may be modified, for example, by site directed or random 
mutagenesis, or deletion mutation, to provide low affinity subunits which are capable of 
associating with low binding affinity and thereby undergo complementation to catalyze an 
enzymatic reaction. For example, reporter subunits capable of complementation with low 
binding affinity may be derived from enzymes such as P-galactosidase, P-glucuronidase 
(GUS), p-lactamase, alkaline phosphatase, peroxidase, chloramphenicol acetyltransferase 
(CAT) and luciferase. Any of a range of enzymes capable of producing a detectable 
product eitheT directly or indirectly may be so modified or may occur naturally. 
Additionally, reporter subunits may be derived from non-enzymatic molecules. For 
example, association of two proteins may generate a unique conformation in one or both of 
the interacting proteins that can be recognized by an antibody or other ligand. 

P-galactosidase, which is encoded by the £. coli lacZ gene, is an enzyme which has 
been developed in the art as reporter enzyme. P-galactosidase activity may be measured by 
a range of methods including live-cell flow cytometry and histochemical staining with the 
chromogenic substrate 5-bromo-4-chloro-3-indolyl p-D-galactopyranoside (X-Gal). Nolan 
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et al, Proc. Natl Acad Set, USA, 85:2603-2607 (1988); and Lojda, Z., Enzyme 
Histochemistry: A Laboratory Manual, Springer, Berlin, (1979), the disclosures of which 
are incorporated herein. 

Enzyme mutants capable of intracistronic complementation are especially suitable 
5 as reporter subunits. In E. coli, deletions of either the N or C terminus of p-gal produce 

enzyme that is inactive yet can be complemented by coexpression with a second inactive 
deletion mutant containing domains lacking in the first. The N- and C- terminal domains 
involved in complementation are known as the a and co regions. Ullmann et al, J. Mol 
Biol, _12:918-923 (1965); Ullman et al,J. Mol Biol, 24:339-343 (1967); and Ullman et 

10 al, J. Mol Biol t 32:1-13 (1968), the disclosures of which are incorporated herein. P-Gal 

complementation systems in mammalian cells are described in Mohler and Blau, Proc. 
Natl Acad Sci. USA, 93:12423-12427 (1996), the disclosure of which is incorporated 
herein. As described therein, vectors expressing complementing mutants of P-gal may be 
constructed. A naturally occurring lacZ mutation, AM 15 (Beckwith, J. Mol Biol, 8:427- 

15 430 (1964); and Prentki, Gene, 122:231-232 (1992) and Nature, 369:761-766 (1994), the 

disclosures of which are incorporated herein) designated as Act herein may be constructed. 
Another deletion mutation, designated Aco herein, was made as disclosed herein, and its 
structure is shown schematically in Figure 1 . The peptide region between the a and co 
regions is referred to herein as the \i region, as first defined by Mohler and Blau, Proc. 

20 Natl Acad. Sci. USA, 93:12423-12427 (1996). The Aa and Aco mutants are demonstrated 

herein to have optimal forced complementation properties. These deletion mutants express 
polypeptides representing an a-acceptor/co-donor (Aa) and an a-donor/oo-acceptor (Aco). 

p-Gal complementation is based on the ability of mutant enzyme molecules to 
associate and reconstitute an active enzyme. Accordingly, two p-gal molecules that each 

25 lack one or more structural domains critical to the activity of the holoenzyme, associate to 

form a single functional unit that contains all of the required structural determinants. This 
phenomenon is dependent on the fact that interactions that would normally take place 
between domains of the single peptide of wild type p-gal, can also exist between domains 
present on two distinct peptides, leading to the formation of a stable dimer. This dimer 

30 behaves functionally as a single peptide of wild type p-gal, and participates ultimately in 

the formation of the tetramer that represents the active form of the enzyme. Thus, the 
ability of a pair of P-gal mutants to recreate an active form of the enzyme is strongly 
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dependent on their ability to form a stable dimer and therefore would be expected to be 
dependent on their affinity for each other. 

Surprisingly, it has been discovered that forced association or complementation of 
two distinct low affinity (5-gal mutants results in an efficient formation of active enzyme 
molecules in mammalian cells even though they have relatively low affinity for each other. 
The forced complementation results when the two mutant subunits are brought into 
association due to the binding affinity of the binding moieties attached to the mutant 
subunits. By engineering constructs in which domains or proteins of interest drive the 
dimerization between Aa and 

Aco P-gal mutants, it is possible to monitor and quantitate such interactions by assessing 
the efficiency of complementation obtained by coexpression of these fusion proteins in 
intact eucaryotic cells. 

In addition to two-component complementation between Aa and Aco P-gal mutants, 
the invention also contemplates three-component complementation among mutants each of 
which contains only a single functional a, or co region. Among other applications, this 
might allow detection of interactions among three distinct proteins based on a single 
reporter. Similarly, higher-order systems containing four or more reporter components are 
within the scope of the invention. 

Using the fused protein systems, protein-protein interactions and their regulation 
can be studied in mammalian cells without relying on the transcriptional activation of a 
reporter construct. Association of the proteins of interest directly results in enzyme 
activity and is independent from other cellular functions. Therefore this system allows the 
detection of complexes that are excluded from the nucleus, or that involve partners that 
inhibit transcription. Furthermore it allows the detection, quantitation and determination 
of the localization of specific binding interactions within cells, as well as the temporal 
distribution of such binding interactions. Binding interactions may be compared in cells at 
different stages of development or differentiation, as well as in normal vs. pathologic cells 
and in infected vs. uninfected cells, to give but a few examples. Binding interactions can 
therefore be assessed against a background of endogenous competing components that may 
differ in nature and in concentration among different cell types. 

Other enzymes may be identified or constructed which are capable of forced 
complementation in the reporter systems described herein. For example, the phenomenon 
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of intracistronic complementation of enzymatic activity has been described for tryptophan 
synthetase. Jackson et al J, Biol. Chem., 244:4539-4546 (1969). Complementation 
between mutant subunits of thymidylate synthase has been described. Pookanjanatavip et 
al t Biochemistry 31:10303-10309 (1992), the disclosure of which is incorporated herein. 
Thus, reporter subunits derived from any complementing enzyme system known in the art 
can be used in the practice of the present invention. Mutants can be derived from other 
enzymes or proteins that are capable of serving as reporters of protein-protein interactions, 
or whose activity can be regulated as described above. The system exploits the 
complementation ability of low binding affinity enzyme mutants for detection of protein- 
protein interactions. 

For example, complementing low affinity reporter subunits derived from 0- 
lactamase can be constructed. Activity of the complementing P-lactamase can be detected 
using substrates for P-lactamase developed in the art which include a fluorescent donor 
moiety and a quencher, which include an attached group which makes them permeable 
through cell membranes, wherein the attached group is hydrolyzed off after the substrate 
enters the cell. Fluorescence energy transfer between the donor and quencher then can be 
monitored as an indicator of P-lactamase activity, as described in PCT WO 96/30540 
published October 3, 1 996. 

In addition to enzymes which catalyze a reaction to produce a detectable product, 
proteins, protein domains or protein fragments which are themselves detectable upon 
association can be used. Exemplary proteins include green fluorescent proteins, which 
have characteristic detectable emission spectra, and have been modified to alter their 
emission spectra, as described in PCT WO 96/23810, the disclosure of which is 
incorporated herein. Fusions of green fluorescent proteins with other proteins, and DNA 
sequences encoding the fusion proteins which are expressed in cells are described in PCT 
WO 95/07463, the disclosure of which is incorporated herein. 

Other exemplary subunits include subunits which are capable of associating to 
produce a photochemical signal such as a fluorescent or luminescent signal, including 
chemiluminescent or photoluminescent signals. The reporter subunits also may comprise 
fluorophores which are capable of detectable resonance energy transfer when they are 
closely associated, as disclosed, for example, in EP Publication No. 0 601 889 A2 and 
PCT WO 96/41 166, the disclosures of which are incorporated herein. 
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Other complementing enzymes are known in the art, for example, pancreatic 
ribonuclease and Staphylococcal nuclease. Mutants of the complementing subunits of 
these enzyihes can be constructed, by methods well-known to those of skill in the art such 
as site-directed mutagenesis, to generate low-affinity complementing subunits. One 
possible use for these types of complementing protein is as a tumor therapeutic, wherein a 
tumor-specific protein serves as a bridge to bring together two proteins, each of which is 
fused to a low-affinity complementing fragment of the nuclease. The resultant nuclease 
activity might, in some cases, kill the cell by destroying mRNA, genomic DNA, etc. 

Binding Moieties 

Binding moieties which can be assayed for their binding affinity with each other 
include any molecules capable of a binding interaction. The binding interaction between 
the two or more binding moieties may be either direct or in the form of a complex with one 
or more additional binding species, such as charged ions or molecules, ligands or 
macromolecules. 

The binding moieties which are attached to the reporter subunit can be any of a 
range of different molecules including carbohydrates, lipids, proteins, and nucleic acids, as 
well as portions, polymers and analogues thereof, provided they are capable of being 
linked to the reporter subunit. Exemplary proteins include members of a signal 
transduction cascade, proteins regulating apoptosis, proteins that regulate progression of 
the cell-cycle or development of tumors, transcriptional regulatory proteins, translational 
regulatory proteins, proteins that affect cell interactions, cell adhesion molecules (CAMs), 
ligand-receptor pairs, proteins that participate in the folding of other proteins, and proteins 
involved in targeting to particular intracellular compartments, such as the Golgi apparatus, 
endoplasmic reticulum, ribosomes, chloroplasts and mitochondria. 

Other exemplary proteins include protein hormones and cytokines. Cytokines 
include those involved in signal transduction, such as interferons, chemokines, and 
hematopoietic growth factors. Other exemplary proteins include interleukins, 
lymphotoxin, transforming growth factors-a and p, and macrophage and granulocyte 
colony stimulating factors. Other proteins include intracellular enzymes such as protein 
kinases, phosphatases and synthases. 

Exemplary proteins involved in apoptosis include tumor necrosis factor (TNF), Fas 
ligand, interleukin-ip converting enzyme (ICE) proteases, and TNF-related apoptosis- 
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inducing ligand (TRAIL). Proteins involved in the cell cycle include deoxyribonucleic 
acid (DNA) polymerases, proliferating cell nuclear antigen, telomerase, cyclins, cyclin 
dependent kinases, tumor suppressors and phosphatases. Proteins involved in transcription 
and translation include ribonucleic acid (RNA) polymerases, transcription factors, 
5 enhancer-binding proteins and ribosomal proteins. Proteins involved in cellular 

interactions such as cell-to-cell signaling include receptor proteins, and peptide hormones 
or their enhancing or inhibitory mimics. 

Binding of molecules will depend upon factors in solution such as pH, ionic 
strength, concentration of components of the assay, and temperature. In the binding assays 

10 using reporter systems described herein, the binding affinity of the binding moieties should 

be high enough to permit forced complementation between the reporter subunits. Non- 
limiting examples of dissociation constants of the binding moieties in an assay solution, 
such as a buffered system or cell interior, are on the order of less than about 10" 8 M, for 
example, less than about 

15 10" 9 M, or optionally, between about 10" 9 to 10' I2 M, depending upon the properties of the 

particular assay system. 

Linking of the Reporter Subunit and the Binding Moiety 

The reporter subunit and one or more binding moieties are generally linked either 
directly or via a linker, and are generally linked by a covalent linkage. For example, when 

20 the reporter subunit and the binding moiety are proteins, they may be linked by methods 

known in the art for linking peptides. 

In one preferred embodiment, the reporter subunit and the binding moiety comprise 
a fusion protein including the reporter subunit which is a low binding affinity enzyme 
complement and the binding moiety being assayed. The fusion protein can thus be 

25 expressed from an encoding nucleic acid intracellularly. This system is advantageous 

since it permits the detection and quantitation of protein-protein interactions in cells, such 
as mammalian cells, based on enzymatic complementation of the low affinity reporter 
subunits. 

For example, in the embodiment wherein chimeric fused proteins are produced 
30 intracellularly that include one of two complementing low affinity p-gal mutants and a 

"test" protein of interest, the detected P-gal activity due to interactions between two 
chimeric proteins of interest will be proportional to the strength of the interaction of the 
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non-p-gal protein components. Thus, the interaction is driven by the test proteins of 
interest, not the complementing mutants. The enzymatic activity serves as an indicator of 
that interaction. Another advantage of this system is that only low levels of expression of 
the test proteins are required to detect binding. 

The fusion gene constructs preferably are constructed and transformed into cells to 
produce low level expression. The system then permits the monitoring of interactions in a 
given cell in the presence of endogenous competing protein partners, where the fusion 
protein will function as a "tracer" for the binding/association reaction. Such a system will 
not be prone to artifacts arising from overexpression of introduced proteins. Reduction in 
expression of fusion gene constructs can be accomplished by choice of appropriate 
promoters, ribosome binding sites and other regulatory elements. For example, fusion 
gene constructs can be introduced into vectors in which they lie upstream of an antibiotic 
resistance gene whose translation is regulated by the Encephalomyocarditis virus internal 
ribosome entry sequence (IRES), and which contain a mutation in the splice 
donor/acceptor sequences upstream of the ATG sequence responsible for translational 
initiation of the fusion gene. This type of construct results in a lower translation efficiency 
of the first coding sequence in a bicistronic message, but does not affect translation of the 
second (antibiotic resistance) sequence, which is solely dependent on the IRES. As a 
result of these reduced levels of expression, the frequency of spontaneous interaction of 
reporter subunits, which is concentration-dependent, will be significantly reduced. 

Expression of Fusion Proteins 

The invention provides fusion proteins between a putative binding moiety and a 
low affinity reporter subunit. The putative binding moiety may comprise any protein or 
other molecule whose ability to bind to a second molecule is to be tested. The low affinity 
reporter subunit may be any molecule wherein the monomer subunit is inactive, but 
association of two or more identical or different monomers restores activity. The activity 
must be capable of generating a detectable signal. In a preferred embodiment, the low 
affinity reporter subunits comprise mutants of P-galactosidase capable of complementation 
with one another. 

Fusion proteins comprise a single continuous linear polymer of amino acids which 
comprise the full or partial sequence of two or more distinct proteins. The construction of 
fusion proteins is well-known in the art. Two or more amino acids sequences may be 
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joined chemically, for instance, through the intermediacy of a crosslinking agent. In a 
preferred embodiment, a fusion protein is generated by expression of a fusion gene 
construct in a cell. A fusion gene construct comprises a single continuous linear polymer 
of nucleotides which encodes the full or partial sequences of two or more distinct proteins 
in the same uninterrupted reading frame. Fusion gene constructs generally also contain 
replication origins active in eucaryotic and/or procaryotic cells and one or more selectable 
markers encoding, for example, drug resistance. They may also contain viral packaging 
signals as well as transcriptional and/or translational regulatory sequences and RNA 
processing signals. 

The fusion gene constructs of the invention are introduced into cells to assay for 
binding between the putative binding moieties encoded by the fusion gene constructs. The 
fusion gene constructs may also contain promoters and other transcriptional and/or 
translational regulatory sequences that are normally associated with the gene encoding the 
putative binding moiety. The fusion gene constructs may be introduced into cells by any 
method of nucleic acid transfer known in the art, including, but not limited to, viral 
vectors, transformation, co-precipitation, electroporation, neutral or cationic liposome- 
mediated transfer, microinjection or gene gun. Viral vectors include retroviruses, 
poxviruses, herpesviruses, adenoviruses, and adeno-associated viruses. Particularly 
preferred in the present invention are retroviral vectors, which are capable of stable 
integration into the genome of the host cell. For example, retroviral constructs encoding 
integration and packaging signals, drug resistance markers and one or more fusion genes of 
interest are useful in the practice of the invention. 

Different fusion gene constructs encoding unique fusion proteins may be present on 
separate nucleic acid molecules or on the same nucleic acid molecule. Inclusion of 
different fusion gene constructs on the same nucleic acid molecule is advantageous, in that 
uptake of only a single species of nucleic acid by a cell is sufficient to introduce sequences 
encoding both putative binding partners into the cell. By contrast, when different fusion 
constructs are present on different nucleic acid molecules, both nucleic acid molecules 
must be taken up by a particular cell for the assay to be functional. Thus, problems of cell 
mosaicism are avoided when both fusion gene constructs are included on the same nucleic 
acid molecule. 
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The fusion gene constructs or fusion proteins of the invention may be introduced 
into cultured cells, animal cells in vivo, animal cells ex vivo, or any other type of cell in 
which it is desired to study protein-protein interactions. 

Assays 

The reporter systems disclosed herein may be used to assay binding interactions of 
putative binding moieties attached to low affinity reporter subunits through 
complementation between the low affinity reporter subunits which produces a detectable 
signal. In addition to testing for direct binding interactions between the putative binding 
moieties, interactions dependent upon one or more additional molecules or ions may be 
evaluated. Further, multi-molecular interactions in living animal cells can be evaluated, as 
well as the influence of various drugs, peptides and pharmaceuticals on these interactions. 

In one embodiment, the binding affinity of one or more putative binding moieties 
may be measured by providing a reporter system including one component having one of 
the moieties bound to a low affinity reporter subunit and at least one other component 
including one other putative binding moiety bound to a second low affinity reporter 
subunit. The binding moieties may be different or the same. In the system, the reporter 
subunits are capable of binding and generating a detectable signal only if they are brought 
into proximity by the binding of the one or more putative binding moieties. The signal can 
be directly or indirectly detected and quantitated. 

In one embodiment of the invention, protein-protein interactions can be detected 
and quantitated. The signal produced by the complementing reporter subunits can serve as 
an indicator of binding between the putative binding moieties, either directly or indirectly 
via a third substance. Signals which could be detected include light emission and 
absorbance. Exemplary signals include chromogenic, fluorescent and luminescent signals. 
These signals can be detected and quantitated visually or through the use of 
spectrophotometers, fluorimeters, microscopes, scintillation counters or other 
instrumentation known in the art. 

Binding of components of the reporter systems disclosed herein will depend upon 
factors in solution, such as pH, ionic strength, concentration of components of the assay, 
and temperature. Assay solutions can be designed and developed for a particular system. 
The reporter systems disclosed herein can be used to conduct assays in solutions, such as 
buffered cell free solutions, cell interiors, solutions of cells, solutions of cell lysates, and 
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solutions of cell fractions, such as nuclear fractions, cytoplasmic fractions, mitochondrial 
fractions, and membrane fractions. Methods for preparing assay solutions, such as enzyme 
assay solutions, cell extracts, and cell suspensions, known in the art may be used. For 
example, physiologically compatible buffers such as phosphate buffered saline may be 
5 used. See for example, the series, Methods in Enzymology, Academic Press, New York. 

In one embodiment, the low affinity reporter subunits are capable of 
complementing one another to form an enzymatically active complex that is capable of 
catalyzing the conversion of a substrate to a product which is detectable, either directly or 
indirectly. In one embodiment, the reporter system can include two or more components, 

1 0 each of which is a fusion protein, wherein the fusion proteins each comprise a putative 

binding protein fused to a low affinity reporter subunit. Thus, nucleic acids encoding the 
fusion proteins can be constructed, introduced into cells and expressed in cells. 
Alternatively, the bound reporter units or bound binding moieties can be detecting by 
detecting the binding of a labeled specific binding moiety such as an antibody to the bound 

15 complex. 

In one embodiment, the low affinity reporter subunits may be complementing 
subunits of p-gal. The system may include three or more reporter subunits all of which are 
required to associate in order to produce the detectable signal. Methods for detecting the 
reaction products of active p-gal that have been developed in the art may be used. For 

20 example, p-galactosidase activity may be measured by a range of methods including live- 

cell flow cytometry and histochemical staining with the chromogenic substrate 5-bromo-4- 
chloro-3-indolyl P-D-galactopyranoside (X-Gal). Nolan etal r Proc. Natl Acad Sci, USA, 
85:2603-2607 (1988); and Lojda, Z., Enzyme Histochemistry: A Laboratory Manual, 
Springer, Berlin, (1979), the disclosures of which are incorporated herein. Histochemical 

25 staining for p-gal can be achieved by fixation of cells followed by exposure to X-gal. 

Assays for p-gal activity described in Mohler and Blau, Proc. Natl Acad Set , 
93:12423-12427 (1996), the disclosure of which is hereby incorporated by reference, may 
be used. In one embodiment, intracellular analyses may be conducted by fixing cells and 
staining with the indigogenic substrate X-gal. Fixed cells also can be analyzed by assaying 

30 for p-gal activity by fluorescence histochemistry using an azo dye in combination with 

either X-gal or 5-bromo-6-chloro-3-indolyl P-D-galactopyranoside (5-6-X-Gal). A 
preferred combination is the azo dye red violet LB (Sigma Chemical, St. Louis, MO) and 
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5-6-X-Gal, referred to as Fluor-X-gal. For this combination, fluorescence micrographs can 
be obtained on a fluorescence microscope using a rhodamine/Texas Red filter set. Use of 
these substrates allows, for the first time, P-gal-dependent fluorescence to be visualized 
simultaneously with two or more other fluorescent signals. 

Vital substrates for P-gal, which can be used in living cells, are also encompassed 
by the invention. For example, a vital fluorogenic substrate, resorufin P-galactoside 
bis-aminopropyl polyethylene glycol 1900 (RGPEG) has been described. Minden (1996) 
BioTechniques 20(1) : 122- 129. This compound can be delivered to cells by microinjection, 
electroporation or a variety of bulk-loading techniques. Once inside a cell, the substrate is 
unable to escape through the plasma membrane or by gap junctions. Another vital 
substrate that can be used in the practice of the invention is fluorescein di-P-D- 
galactopyranoside (FDG), which is especially well-suited for analysis by fluorescence- 
activated cell sorting (FACS) and flow cytometry. Nolan et al (1988) Proc. Natl Acad, 
Set USA 85:2603-2607 and Rotman et al (1963) Proc. Natl Acad Sci. USA 50:1-6. 

P-gal may also be detected using a chemiluminescence assay. For example, cells 
containing P-gal fusions are lysed in a mixture of buffers containing Galacton Plus 
substrate from a Galactolight Plus assay kit (Tropix, Bedford MA). Bronstein et al 9 J. 
Biolumin. Chemilumin., 4:99-1 1 1 (1989) the disclosure of which is incorporated herein. 
After addition of Light Emission Accelerator solution, luminescence is measured in a 
luminometer or a scintillation counter. 

Reporter systems other than p-gal may also be used in the practice of the invention. 
For example, the enzyme P-glucuronidase (GUS) can be used as a reporter and 
chromogenic and fluorogenic GUS substrates have been developed. The GUS substrate 5- 
bromo-4-chloro-3-indolyl p-D-glucuronic acid (X-gluc) can be used in both chromogenic 
and fluorogenic applications, as follows. In one method of chromogenic staining, fixed 
cells are washed in PBS and stained with 2 raM X-gluc (Molecular Probes, Eugene OR), 
10 mM EDTA, 0.5 mM K 3 Fe(CN) 6 , 0.5 mM K 4 Fe(CN) 6 , 0.1% Triton X-100, 0.1 M 
NaP0 4 . Fluorogenic staining may be achieved by using a combination of 5-bromo-6- 
chloro-3-indolyl P-D-glucuronic acid (5, 6 X-gluc, Molecular Probes, Eugene, OR) and 
Fast Red Violet LB (Sigma Chemical, St. Louis, MO). Fixed ceils are rinsed with PBS 
and stained in 50 jig/ml 5, 6 X-gluc and 100 ^ig/ml Fast Red Violet LB, then rinsed in PBS. 
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Fluorescence is detected on a fluorescence microscope adjusted for detection of rhodamine 
fluorescence. 

Intone embodiment of the invention, the reporter subunits comprise an enzyme and 
an inhibitor df the enzyme. Preferably, the inhibitor has a low affinity for the enzyme. In 
this case, association between the putative binding moieties is evidenced by inhibition of 
the activity of the enzyme. Exemplary enzymes include p-gal, GUS, P-lactamase, etc. 

While dimeric reporter subunit complexes are discussed herein by way of example, 
multimeric reporter subunits also can be used, as can reporter subunits which are only 
active in the presence of one or more additional molecules or atoms. An example of a 
trimeric reporter subunit system would be one consisting of a P-gal co donor (such as a 
Aa-Aji double mutant), a p-gal \i donor (such as a Act-A© double mutant) and a p-gal a 
donor (such as a Aji-Acd double mutant), wherein each individual mutant, and any pairwise 
combination of two mutants, is enzymatically inactive. Activity would be obtained only if 
all three subunits were able to associate with one another. Enzyme reaction products can. 
be detected using methods available in the art, such as biochemical assay, microscopy, 
flow cytometry, light emission or absorption detection, and immunological methods. 

The methods disclosed herein enable the detection and quantitation of binding 
events in cell lysates, as well as in intact cells. Thus, interactions between fully folded 
proteins are detectable, and co-translational expression of the binding moieties is not 
necessary for binding to be detected. 

In the practice of the invention, the reaction product may be detected indirectly, for 
example, through immunological techniques, such as immunofluorescent labeling. 

Protein-protein interactions can be measured in a reporter system which includes 
one or mpre fusion proteins. The fusion proteins each include a putative binding protein 
coupled tb a low affinity reporter subunit. For intracellular expression of the fusion 
proteins, one. or more fusion gene constructs are prepared which include sequences 
encoding the fusion protein(s). The fusion gene constructs may be introduced into cells by 
methods available in the art, including, but not limited to, viral vectors, transformation, co- 
precipitation, electroporation, neutral or cationic liposome-mediated transfer, 
microinjection or gene gun. 

A variety of cell-based assays can be conducted using the cells containing the 
fusion gene constructs. Binding of the putative binding moieties on the fusion proteins 
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expressed in the cells can be confirmed by detecting the signal produced by the reporter 
subunits undergoing forced complementation. Thus, for example, when the reporter 
subunits are complementing P-gal subunits, cells exhibiting P-gal activity indicate binding 
between the putative binding moieties within those cells. 

5 The fusion gene constructs may also contain promoters and other transcriptional 

and/or translational regulatory sequences that are normally associated with the gene 
encoding the putative binding moiety. This permits the study of physiologically-relevant 
levels of the putative binding proteins in vivo, in contrast to systems in which test proteins 
are overexpressed. Further, this permits the study of naturally-occurring changes in levels 

10 of binding activity over time and can reveal the effects of endogenous or exogenous 

substances on binding interactions. 

The methods and compositions of the invention can also be used to study other 
molecules which influence the interaction of two putative binding partners. Proteins, 
peptides, nucleic acids, carbohydrates, lipids, ions, small molecules, synthetic compounds 

15 or other substances (either endogenous to the cell or exogenously added) may act as either 

agonists or antagonists of a binding interaction. By measuring the effect of such molecules 
on, for example, p-gal activity produced by cells containing two or more fusions 
representing a particular pair of test proteins, agonist or antagonist activity of such 
molecules can be determined. Use of the methods and compositions of the invention will 

20 allow high-throughput assays to be carried out to test for agonists or antagonists of a 

particular binding interaction. Such high-throughput assays will be especially valuable in 
screening for drugs that influence medically-relevant protein-protein interactions. 

Putative binding partners, or putative binding moieties, as used in the invention, 
can include molecules which do not normally interact with each other, but which each 

25 interact with a third molecule so that, in the presence of the third molecule, the putative 

binding partners are brought together. Thus, substances which influence an interaction 
between putative binding partners include those which stimulate a weak interaction 
between putative binding partners, as well as one or more molecules which mediate 
interaction between molecules which do not normally interact with each other. In addition, 

30 substances which influence an interaction between putative binding partners can include 

those which directly or indirectly affect an upstream event which results in association 
between the putative binding partners. For example, if phosphorylation of one of the 
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putative binding partners endows it with the capacity to associate with another of the 
putative binding partners; substances which influence the interaction of the putative 
binding partners include those which directly or indirectly affect a kinase activity. 

Assays can be developed as disclosed herein to examine the effect on 
intermolecular interactions of a variety of compositions including drugs such as antipyretic 
and anti-inflammatory drugs, analgesics, antiarthritics, antispasmodics, antidepressants, 
antipsychotics, tranquilizers, antianxiety drugs, narcotic antagonists, antiparkinsonism 
agents, cholinergic antagonists, chemotherapeutic agents, immunosuppressive agents, 
antiviral agents, parasiticides, appetite suppressants, antiemetics, antihistamines, 
antimigraine agents, coronary vasodilators, cerebral vasodilators, peripheral vasodilators, 
hormonal agents, contraceptives, antithrombotic agents, diuretics, antihypertensive agents, 
cardiovascular drugs, opioids, and vitamins. 

Protein-protein interactions mediated by a third molecule can be detected and 
quantitated. The kinetics of binding also can be studied. An example of such a system is 
described in Examples 1 and 2 below, wherein P-gal fusion proteins are used to monitor 
the rapamycin-mediated interaction of the FKBP12 and FRAP proteins. Belshaw, P. J. et 
a!.,Proc. Natl Acad. Set USA, 93: 4604-4607 (1996); Brown et ah, Nature 369: 756-758 
(1994); Chen, et al y Proc. Natl Acad Set, USA, 92:4947-4951 (1995); and Choi, J. et al, 
Science, 273:239-242 (1996). For example, kinetics of binding can be determined by 
measuring P-gal activity at different times following addition of rapamycin to cultures of 
cells expressing fusions of FKBP12 and FRAP to two complementing, low affinity P-gal 
mutants {e.g., Aa and Ago). A dose-response curve can also be obtained, in which the 
extent of binding, as measured by p-gal activity, is determined as a function of rapamycin 
concentration. 

This assay can be adapted to control for the possible effect of a protein component 
on its fusion partner, thereby enabling the study of protein-protein interactions in a 
quantitative fashion. In one such control system, tripartite fusion constructs including a 
reporter subunit, a binding protein and the protein of interest are provided. As described 
below in Example 3, in one embodiment, the fusion protein includes 1) a P-gal mutant 
portion, 2) a FKBP12 or FRAP portion, and 3) a test protein portion. The most N-terminal 
component is the test protein, followed by FKBP12-Aco or FRAP-Aa. The presence of 
FKBP12 and FRAP in these constructs allows rapamycin-mediated dimerization of the 
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fusion proteins. The absolute values of P-gal activity obtained by simple co-expression of 
a fusion containing a test protein of interest and fusions containing different potential 
interacting partners is determined. In parallel samples, p-gal activity is measured upon 
induction of complementation with a fixed amount of rapamycin. The ratio of P-gal 
5 activity obtained in the absence and the presence of rapamycin indicates the relative 

abilities of the different protein pairs to interact with each other. 

A further advantage of the tripartite fusion system is that the presence of the 
FKBP12 and FRAP components provides a flexible hinge domain between the p-gal 
mutants and the test protein. This reduces the possibility of interference between the P-gal 

10 component and the test protein. Furthermore, it allows direct testing of the functional 

integrity of the P-gal components in the fusions without the need for recloning into more 
efficient viral vectors. For example, the tetracycline repressor, tetR y forms homodimers in 
mammalian cells with high efficiency. Hinrichs et al (1994) Science 264:418-420. 
Coexpression of ted?-FKBP12-Aco and teffl-FRAP-Aa fusions yielded p-gal-positive cells, 

15 as shown in Example 3, showing that it is possible to construct functional tripartite fusions, 

in which dimerization of the N-terminal peptide component efficiently drives 
complementation of the C-terminal mutant p-gal polypeptides, with the FKBP12 and 
FRAP components serving as both internal standards and flexible hinges. 

The system may be further tested and compared by constructing fusions between 

20 each p-gal mutant and the complete coding sequence of MEF2c. Since MEF2c is known 

to form homodimers in vivo, coexpression of both P-gal mutants fused to MEF2c should 
result in readily detectable enzymatic activity. MEF2c mutants that are impaired in their 
dimerization potential are available and fusion of one of the mutants to one of the P-gal 
mutants can serve as a negative control to further validate the system. Molkentin, et al y 

25 Mol Cell Biol, 16:2627-2636 (1996). 

The reporter system can also be designed with controls to permit the quantitation of 
the expression level of the P-gal fusion proteins. This will make it possible to control for 
potential differential expression of the two (or more) fusion proteins. For example, a 
peptide tag for which well-characterized monoclonal antibodies are available may be fused 

30 in frame at the C-terminus of each p-gal mutant. Different tags, such as flag and myc may 

be used for Aa and Aa>, to allow differential detection of the two mutants even when 
coexpressed in the same cells. In parallel with the determination of P-gal activity in the 
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lysates of these cells, an ELISA assay can determine the precise amount of each P-gal 
fusion protein in the same lysates. First, a polyclonal anti- P-gal antiserum may be used to 
immobilii^the antigens. Then the monoclonal antibody directed against the appropriate 
tag follows*! by an enzyme-linked anti-mouse secondary antibody may be used to quantify 
the amount of the P-gal fusion protein of interest. Such an approach, employing well- 
characterized techniques, should allow a determination of the expression levels of each 
fusion protein. This modification will be useful where the attached tag does not impair the 
binding of the protein or the ability of the reporter subunits to complement. 
Applications of the Invention 

As will be apparent to one of skill in the art, the invention allows, for the first time, 
a broad range of studies of multiprotein and other types of multi-molecular interaction to 
be carried out quantitatively or qualitatively in living cells. In what follows, non-limiting 
examples of different applications of the invention are provided. 

The observation that levels of p-gal activity in the presence and absence of forced 
complementation can be distinguished, both biochemically (Figure 5) and by FACS 
(Example 10 and Figure 6), suggests that the methods of the invention can be used to 
screen for new binding partners) for a given target protein. In this embodiment, the target 
protein, fused to a weakly-complementing P-gal mutant is stably expressed in a well- 
characterized cell line. Expression libraries containing cDNAs fused to a weakly- 
complementing p-gal mutant are introduced into these cells using, for example, retroviral 
vectors (e.g., Kitamura et al., Proc Natl Acad. Set USA 92:9146-9150 (1995) ) or any 
pther means of gene transfer known in the art. Vectors expressing gene products that 
interact with the target protein are isolated by identifying p-gal-positive clones. An 
advantage of this system is that the screen can be carried out in any cell type, regardless of 
the cell's milieu of endogenous (and potentially competing) proteins. A further possibility 
for this type of system is that the target protein can be localized to a specific cellular 
compartment, with the aim of identifying proteins involved in interactions restricted to that 
particular location. 

The use of fluorescence-activated cell sorting techniques is particularly well-suited 
to this embodiment of the invention. For example, P-gal-positive cells which contain 
cDNAs expressing gene products that interact with the target protein will generate a signal 
that will allow such cells to be purified by cell-sorting techniques. Such cDNAs could be 
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delivered, for example, using retroviral vectors that allow introduction of high complexity 
cDNA libraries with high infection efficiency. 

The assays and methods of the invention can also be carried out in the presence of 
extracellular signaling molecules, growth factors or differentiation factors, peptides, drugs 
or synthetic analogs, or the like, whose presence or effects might alter the potential for 
interaction between two or more given proteins in a particular cell type. 

Detection of molecular interactions, using the methods and compositions of the 
invention, is not limited to those occurring in the nucleus, nor is it limited to intracellular 
interactions. For instance, interactions involving surface receptors can be detected in the 
practice of the invention. In one embodiment, the invention provides new techniques for 
detecting ligand-induced dimerization of surface receptors in living cells. Dimerization, or 
higher order oligomerization, of cell surface receptors is often a prerequisite for receptor 
activation and ensuing signal transduction. For example, the binding of epidermal growth 
factor (EGF) to its receptor stabilizes the dimerization of the receptor and leads to 
activation of its tyrosine kinase activity. Schlessinger et al (1 992) Neuron 9:383-391 ; 
Ullrich et al (1990) Cell 61:203-212; and Weiss et al (1997) Curr. Opin. Genet Dev. 
7:80-86. Example 1 1, infra, discloses the use of p-gal complementation to monitor 
membrane receptor dimerization in living cells. For this purpose, the weakly 
complementing Act and Aco deletion mutants of P-gal were fused to the extracellular and 
transmembrane regions of the human EGF receptor to form a chimeric receptor molecule 
(see Figure 7A). Deletion of the cytoplasmic domain of the receptor prevents the 
internalization and degradation of the receptor that is normally observed following EGF 
stimulation of cells (Livneh et al (1986) 1 Biol Chem. 261:12490-12497), permitting an 
analysis of receptor dimerization over time in changing conditions. The results presented 
in Example 1 1 demonstrate that this embodiment of the invention can be used to detect a 
previously-wirecognized mode of regulation of EGF receptor signaling, in which EGF 
receptor tyrosine kinase activity acts as a feedback inhibitor limiting the dimerization of 
the receptor. 

The practice of the invention is not limited to detection of interaction between two 
different molecules. Multimerization of a molecule can also be detected using the methods 
and compositions of the invention. In this regard, Example 1 1 discloses the detection of 
receptor dimerization through the practice of the invention. 
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By combining the methods and compositions of the invention with state-of-the-art 
methods for construction of high-titer, high-complexity cDNA libraries in retroviruses 
(e.g., Pear et ai, (1993) Proc. Natl. Acad Sci. USA £0:8392-8396), it will be possible to 
identify interaction partners of a specific test protein in mammalian cells (i.e., perform 
5 functional genomics at the protein level). For this application, construction of cDNA 

libraries in retroviral vectors wherein the cDNA coding sequence is fused to a sequence 
encoding a low affinity reporter subunit will be used. A sequence encoding a binding 
protein of interest will be fused to a low affinity reporter subunit in a first retroviral vector. 
In a second series of retroviral vectors, a second complementing low affinity reporter 

10 subunit will be fused to a variety of different proteins that will be tested for their ability to 

bind to the protein of interest. Testing will be conducted by co-infection of cells with the 
first and one of the series of second retroviral vectors. Those test proteins which are 
capable of binding to the protein of interest will allow detection of a reporter signal in cells 
in which they are co-expressed with the protein of interest. This application will also be 

15 useful in screening for agonists and antagonists of medically-relevant protein interactions. 

In one embodiment of the invention, cells in which a protein encoded by one of the 
series of second vectors is able to interact with the binding protein of interest encoded by 
the first vector are detected and isolated by flow cytometry or fluorescence-activated cell 
sorting (FACS). Methods for flow cytometry and FACS are well-known in the art; e.g., 

20 Nolan et al. (1988) Proc. Natl. Acad. Sci. USA 85:2603-2607; Webster et al., Exp. Cell 

Research 174:252-265 (1988); and Parks et al. (1986) in The Handbook of Experimental 
Immunology, (eds. Weir, D.M., Herzenberg, L.A., Blackwell, C.C. & Herzenberg, L.A.), 
Blackwell, Edinburgh, 4th edition, pp. 29.1-29.21. In this way, clones of cells in which 
binding occurs can be isolated and propagated for further study. This aspect is particularly 

25 suited for studies of developmental mechanisms, wherein it is possible to select a 

population of cells in which a particular developmentally-relevant interaction has occurred 
and study the further development of that cell population, while at the same time, studying 
the further development of cells in which the interaction has not occurred, for comparison. 
In a similar fashion, the practice of the invention makes it possible to isolate and/or study 

30 the further development of cells exhibiting interactions involving protein such as 

transcriptional regulatory proteins, translational regulatory proteins, DNA replication 
proteins, mRNA splicing proteins, proteins involved in signal transduction, proteins 
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involved in cell-cell and cell-substrate adhesion (for example, cell movement, axon 
guidance and angiogenesis), oncogene products, tumor suppressors, proteins involved in 
cell-cycle control and viral proteins, such as those involved in regulation of viral 
replication, virus-host interactions and virus assembly, and proteins which are subunits, 
5 crosslinkers, modifying agents or molecular motors within the cytoskeleton of cells. 

For a given target protein whose gene is capable of being fused to a low-affinity 
complementing reporter subunit, it is possible to identify known and heretofore unknown 
proteins or other endogenous or extraneous substances with which it interacts, by using the 
compositions and methods of the invention. In like manner, for a sequence which encodes 

10 a protein of unknown function, such as may be obtained from a nucleic acid sequence 

database, (or a plurality of sequences such as a cDN A library) the practice of the invention 
allows one to identify molecules with which the encoded protein interacts. The identity of 
the interacting molecule(s) is likely to provide information with respect to the structure 
and/or function of the unknown protein. Thus, the practice of the invention will likely aid 

15 in the identification and characterization of newly-discovered proteins and protein-coding 

nucleic acid sequences. 

In another aspect of the invention, a shotgun approach to the identification of 
protein-protein interactions can be taken by generating a first set of constructs which will 
express the encoded products of one cDNA library fused to a first low-affinity 

20 complementing subunit and a second set of constructs which will express the encoded 

products of a second (or the same) cDNA library, fused to a second low-affinity 
complementing subunit. Co-expression of the two sets of constructs and selection of cells 
in which complementation occurs will allow the isolation of clones and the identification 
of cDNAs which encode interacting partners. One or both of the interacting partners may 

25 be known; alternatively, both of the interacting partners may represent heretofore 

unidentified proteins. If both partners are known, new information about their binding 
specificity may be obtained. If one partner is known, it may provide information on the 
function of the unknown binding partner. If neither are known, the observation that they 
interact may assist in the eventual identification of one or both of the interacting pair. 

30 The invention may be applied to studies of the mechanisms that regulate either 

homo- or hetero-dimerization or multimerization of specific molecules, including high 
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efficiency screening to identify synthetic or naturally occurring compounds capable of 
influencing such dimerization. 

The invention can be used for investigations relating to the localization of specific 
complexes within intact cells, or intact animals. Types of cells which can be used are 
5 primary or established cell lines and other types of embryonic, neonatal or adult cells, or 

transformed cells (for example, spontaneously- or virally-transformed). These include, but 
are not limited to fibroblasts, macrophages, myoblasts, osteoclasts, osteoclasts, 
hematopoietic cells, neurons, glial cells, primary B- and T-cells, B- and T-cell lines, 
chondrocytes, keratinocytes, adipocytes and hepatocytes. 

10 It is also possible, through practice of the invention, to devise systems for 

regulation of enzyme activity by regulating the association of complementing mutants. 
This aspect of the invention has potential applications to human therapy, as a method to 
regulate the enzyme-driven conversion of pro-drugs into their active forms. 

Processes involving molecular interactions, particularly protein-protein 

15 interactions, which can be studied in the practice of the invention include, but are not 

limited to, transcription, translation, replication, mitosis, growth control, progression and 
regulation of the cell-cycle, apoptosis, cell-cell, cell-substratum and cell-ligand 
interactions, intracellular signal transduction cascades, oncogenesis, cell lineages, and 
embryonic development. Examples of cell ligands include leptin and growth factors such 

20 as epidermal growth factor (EGF), nerve growth factor (NGF), platelet-derived growth 

factor (PDGF), and insulin-like growth factors I and II (IGF-I and IGF-II), transforming 
growth factors a and P (TGF-a and TGF-p), endorphins and endorphin receptors, 
prostaglandins and their receptors, cytokines and their receptors, neurotransmitters and 
their receptors, adrenergic receptors, and cholinergic receptors. Receptors which could 

25 interact with ligands include EGF, NGF, and PDGF receptors and leptin receptors. 

Analysis of EGF receptor dimerization, using the methods and compositions of the 
invention, is provided in Example 11, infra. 

Additional interactions that can be studied by the practice of the invention include 
interactions involved in cell metabolism and cell structure. These include, but are not 

30 limited to, interactions that are involved in energy metabolism or which establish or 

modify the structure of the membranes, cytoplasm, cytoskeleton, organelles, nuclei, 
nuclear matrix or chromosomes of cells. Interactions among constituents of the 
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extracellular matrix, or between constituents of the extracellular matrix and cells, can also 
be studied with the methods and compositions of the invention. 

Tfifeinvention will be further understood by the following non-limiting examples. 

EXAMPLES 

Example 1: Preparation and Transfection of Retroviral Construct Encoding a ft- 
Galactosidase Reporter System. 

A reporter system using p-galactosidase ("P-gal") complementation to evaluate 
protein-protein interactions was constructed. A well-characterized protein complex 
developed by Schreiber was used as a test system to provide the protein binding moieties. 
Belshaw, P. J. et al y Proc. Natl Acad Scl USA, 93 : 4604-4607 (1 996); Brown et al., 
Nature 369: 756-758 (1994); Chen, et al,Proc. Natl Acad Scl, USA, 92:4947-4951 
(1995); and Choi, J. et al, Science, 273:239-242 (1996), the disclosures of which are 
incorporated herein. In this protein complex, the intracellular rapamycin binding protein, 
FK506-binding protein-12 (FKBP12), interacts with intracellular FKBP-rapamycin 
associated protein (FRAP) only when rapamycin is present in the culture medium, an 
interaction that increases with the dose of rapamycin. Rapamycin is a small, cell-permeable 
molecule that binds to the two intracellular proteins via independent determinants. Since 
rapamycin is unable to bind two FKBP12 molecules at the same time and FRAP only 
binds rapamycin within the FKBP12-rapamycin complex, only heterodimers should form 
upon rapamycin treatment. Ho, S. N. et al, Nature, 382:822-826 (1996), the disclosure of 
.which is incorporated herein. 

The p-gal system was combined with the FKBP 1 2/FRAP/rapamycin system as 
follows. Two different retroviral constructs were produced, each encoding fusion proteins 
of the Acs) or Act P-gal mutants, and either FKBP 12 or the FKBP-rapamycin binding 
domain of FRAP, respectively (FKBP 12- A© and FRAP-Aa). 

The Aa or Aco p-gal mutants were obtained as described in Mohler and Blau, Proc. 
Natl Acad Scl, 93:12423-12427 (1996), the disclosure of which is incorporated herein. 

To fuse the sequences coding for FKBP 12 and the FKBP12-rapamycin binding 
domain in frame with P-gal, an adapter oligonucleotide (C ATGGAGCTCGAGAG) 
containing an Xhol site was inserted in the Ncol site at the ATG of the Aa and A© p-gal 
mutants described by Mohler and Blau, supra. Two Xhol-Sall DNA fragments 
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corresponding to amino acids 2025-21 14 of human FRAP and to the complete coding 
sequence of human FKBP12 were cloned in the Xhol site of the modified Aa and Ao> 
mutants, generating FRAP-Aa and FKBP12-Aa>. Conservation of the appropriate reading 
frame was confirmed by sequencing for both constructs. 

To insert the FRAP-Aa and FKBP12-Aco coding sequences in the pWZL-Neo and 
pWZL-Hygro retroviruses, an adapter oligonucleotide containing Ncol and BamHI sites 
(GATCACCATGGACGCGTGGATCCC) was inserted in the BamHI and Xhol sites of 
the pWZL vectors. Both the original sites were destroyed by this insertion. The FRAP-Aa 
and FKBP12-Aco coding sequences were then inserted in the modified pWZL vectors as 
NcoI-BamHI fragments. 

The cDNAs encoding FKBP12-A© and FRAP-Aa were inserted into a mouse 
ecotropic retroviral vector upstream of the hygromycin resistance or neomycin resistance 
genes, respectively, as described above. By using an Encephalomyocarditis virus internal 
ribosomal entry sequence (IRES), introduction of a single retroviral vector ensured 
production of a bicistronic mRNA and translation of both the Aa -p-gal-FRAP-protein and 
the drug selectable hygromycin protein. A second retroviral vector yielded the Aco-p-gal- 
FKBP12 protein and neomycin resistance protein. 

For virus production and infection, proviral constructs were introduced into 
packaging cells by calcium phosphate transfection. The supernatant media containing 
retrovirus from the packaging cells was harvested 24 to 72 hours after transfection and 
used to infect C2C12 cells in the presence of 8 jag/mL polybrene. Singly and doubly 
infected cells were selected with the appropriate drugs. Both Geneticin and Hygromycin 
were used at a final concentration of 1 mg/ml. The selected cells were expanded as 
populations for subsequent experiments. 

Although the background (3-gal detected with the Aa and Ago mutants expressed 
from MFG retroviral vectors described previously (Dhawan et al, Science, 254: 1509-1512 
(1991) was relatively low (Mohler, W. A., & Blau, H. M, Proc. Natl Acad Set USA, 
93:12423-12427 (1996), the disclosure of which is incorporated herein), it was further 
reduced by using retroviral vectors with point mutations that deleted the splice 
donor/acceptor sequences upstream of the p-gal ATG (pWZL). These mutations result in a 
lower translation efficiency of the first coding sequence contained in the vector, but do not 
affect the expression of the selectable marker, which is solely dependent on the IRES. 
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Using this vector, two-fold less of the upstream protein was expressed compared to vectors 
containing the same LTRs (long terminal repeats) and the wild-type splice donor/acceptor 
sequences. Such a reduction in translation reduces the concentration of the fusion protein 
and consequent spontaneous interactions of p-gal mutants irrespective of the test proteins 
5 to which they are fused. As a result, in preliminary experiments, the background enzyme 

activity measured by luminometer for Aa and Acq p-gal mutants alone was reduced from 
low to essentially undetectable. 

Infectious viral particles were produced by transient transfection of each construct 
shown in Figure 2a into a packaging cell line modified from that described by Pear et ai, 

10 (1993) Proc. Nail. Acad ScL USA 90:8392-8396 by calcium phosphate transfection. The 

supernatant media containing retrovirus from the packaging cells was harvested 24 to 72 
hours after transfection and used to infect C2C12 cells in the presence of 8 ng/mL 
polybrene. C2C12 myoblasts were infected either singly with each retrovirus alone or 
simultaneously with both. All experiments were performed after selection with 

15 hygromycin and G418 to ensure that 100% of the cells contained both constructs. Both 

Geneticin and hygromycin were used at a final concentration of lmg/ml. The selected 
cells were expanded as populations for subsequent experiments. 

Example 2 : Assays of Binding and Activity of the P-Galactosidase Reporter 
System. 

20 Following the addition of rapamycin to the media, the transfected cells obtained as 

described in Example 1 were assayed for P-gal activity. As shown in Figure 3, C2C12cells 
expressing both FKBP12-A(0 and FRAP-Aa were tested by exposure to 10 ng/ml 
rapamycin (Figure 3b) for 12 hr or to no drug at all (Figure 3a). Only those cells 
expressing both constructs exhibited p-gal activity, readily visualized by X-gal staining of 

25 fixed cells (Figure 3b). It is advantageous that cytoplasmic staining is detectable with this 

method, in contrast to prior methods such as the yeast two-hybrid system, which report 
only nuclear interactions. X-gal staining was conducted as follows: Cells were fixed 5 
minutes in PBS plus 4% paraformaldehyde and rinsed in PBS prior to staining. 
Indigogenic X-gal staining was performed overnight at 37°C in PBS plus 1 mg/mL X-gal, 

30 1 mM MgCl 2 , 5 mM K 3 Fe(CN) 6 , 5 mM K4Fe(CN) 6 . 

The kinetics of activation of P-gal upon rapamycin treatment were determined. 
C2C12 cells expressing both fusion proteins were plated in replicate in 96 well plates. 
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Rapamycin was added to the culture medium, and the p-gal activity measured at different 
time points. For each time point, six replicate samples were assayed with a sensitive 
chemiluminescence assay, as described in Mohler, W. A., & Blau, H. M., Proc. Natl 
Acad Set, USA, 93:12423-12427 (1996), the disclosure of which is incorporated herein. 
In the assay, cells cultured in microtiter plates were lysed in situ in 50 |iL of a 1 :3 mixture 
of lysis and assay buffers containing Galacton Plus substrate from the Galactolight Plus 
assay kit (Tropix, Bedford, MA). Reactions proceeded for 1 hour at room temperature. 
After addition of Light Emission Accelerator solution, luminescence was measured in a 
scintillation counter. 

The results, shown in Figure 4, indicate that the interaction assays for the fusion 
proteins are specific, and exhibit similar kinetics and a comparable dose-response curve to 
assays of the FKBP12/FRAP/rapamycin protein complex alone. Ho, S. N. etal, Nature, 
382:822-826 (1996). Rapamycin induced a 30-fold increase in P-gal activity within 5 
hours. As a control, no rapamycin was added, and no p-gal activity was detected above 
background. As a second control, in cell populations expressing only one of the two 
constructs, p-gal activity did not increase above background when rapamycin was added. 

In Figure 4b, the dose response curve is shown. The activation of P-gal was 
dependent on the dose of rapamycin, which appeared linear over a range of 0 to 10 ng/ml 
of the drug. This linearity provides support that p-gal enzymatic activity can serve as a 
reporter to quantitate protein-protein interactions. The close approximation of both the 
dose response and the kinetics to that observed by others (Ho, S. N. et ai 9 Nature, 
382:822-826 (1996)) suggests that the fusion to p-gal peptides is not interfering with the 
interaction of the FKBP12 and FRAP proteins. Moreover, endogenous FKBP12 and 
FRAP proteins are ubiquitously expressed and are capable of interacting with each other or 
with the fusion proteins in the presence of rapamycin, without generating p-gal activity. 
Detection of P-gal activity, as shown above, indicates that productive FRAP-Aa and 
FKBP12-Aco dimers will form in a cellular environment containing competing endogenous 
proteins, and that the resultant p-gal activity reflects the interaction of FRAP and FKBP12- 
rapamycin Thus, the p-gal fusion proteins can be used to monitor the interaction of 
proteins in the FKBP12/FRAP/rapamycin complex and in other types of multiprotein 
complexes. 
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It is also possible to detect and quantitate binding activity in cell lysates. As shown 
in Figure 5, cells expressing both FKBP12-A© and FRAP-Aa fusion proteins were 
expanded in the absence of rapamycin and lysed. 100 ng/ml rapamycin was added to half 
of the samples, and the p-gal activity in the treated and untreated ly sates was determined 
immediately (white bar), after one hour (black bar) or after 3 hours (gray bar). A greater 
than two-fold increase in p-gal activity was observed in the rapamycin-treated lysates one 
hour after administration of the drug. In control lysates that were not exposed to 
rapamycin, no statistically significant increase in p-gal activity was detected. The ability 
to detect and quantitate protein-protein interactions in cell lysates using the methods and 
compositions of the invention indicates that interactions between mature, fully-folded 
proteins can be detected and quantitated; co-translational assembly of complexes in not 
required for formation of complexes that can be monitored by P-gal activity. 

Example 3 : Tripartite fusions for the quantitation of protein-protein interactions. 

To permit protein interactions to be studied in a quantitative manner in the system 
described in the above Examples and to control for effects on either the binding ability of 
the binding moiety or the complementing ability of the reporter subunits resulting from 
both activities being present in a single fusion protein, additional modifications were made 
to monitor the expression of the components of the system. In the above described system, 
the p-gal fusion proteins will be expressed from the same viral promoter, however, for 
some proteins, it is possible that their expression level will be influenced by the specific 
fusion partner. In particular, some proteins or domains could affect the stability or 
conformation of the p-gal domain. As a result, differences in the ability of the test proteins 
(the putative binding moieties) to complement one another could be observed that are not 
based on a physiological mechanism. 

In order to avoid these problems, fusions containing three components (P-gal 
mutant, FKBP12 or FRAP, and the test protein) were constructed. The most N-terminal 
component is the test protein, followed by FKBP12-Aa> or FRAP-Aa (see the exemplary 
system in Figure 2b, where the test protein portions of the fusion are indicated by X and 
X'). The presence of the FKBP12 and FRAP portions allows rapamycin-mediated 
dimerization of these fusions, and the efficiency of p-gal complementation in the presence 
of rapamycin appears to be dependent on the FKBP12/FRAP/rapamycin interaction. The 
absolute values of p-gal activity obtained by simple coexpression (in the absence of 
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rapamycin) of fusions containing a fixed protein of interest and different interacting 
partners was determined. In parallel samples, p-gal activity was measured upon induction 
of complementation with a fixed amount of rapamycin. The ratio between the P-gal 
activity obtained in the absence or in the presence of rapamycin indicated the relative 
ability of the different protein pairs to interact with each other. An added advantage of this 
approach is that the presence of the FKBP12 and FRAP domains provide a flexible hinge 
between the p-gal mutants and the putative binding moieties that are being analyzed. This 
reduces the possibility of interference between P-gal and the proteins of interest. 
Furthermore, it allows direct testing of the functional integrity of the p-gal components in 
the fusions without the need for recloning into more efficient viral vectors. 

Results were obtained with tetR-FKBP12-Aa> or tetR-FRAP-Aa tripartite fusions 
(see example in Figure 2b). Coexpression of these constructs, in which dimerization is 
driven by the tetracycline repressor (tetR) protein (Hinrichs, W. et al. 9 Science, 264:418- 
420 (1994), the disclosure of which is incorporated herein), readily yielded P-gal positive 
cells. This indicates that functional tripartite fusions can be constructed, in which the 
dimerization of the most N-terminal peptide component efficiently drives complementation 
of the C-terminal p-gal deletion mutant polypeptides. 

Example 4: Dimerization of myogenic regulators using complementing p-g al 
fusion proteins 

The p-gal complementation system is used to assay for the dimerization and 
nuclear translocation of HLH proteins (helix-loop-helix proteins, Murre et al (1989) Cell 
56:777-783) including activators of muscle-specific proteins (myoD, myogenin, myf5, 
MRF-4), inhibitors of myogenesis (Id, Mtwist, I-mf) and ubiquitous E2A-type proteins 
(E47,E12,HEB). 

In a first step, a myoD-Aa-p-gal (myoD-Act) fusion construct and a E12-A<o-p-gal 
(E12-Aa>) fusion construct are engineered in selectable retroviral vectors, as described 
above for FRAP- Act and FKBP12-Aco. The two constructs are transduced into C2C12 
myoblasts. Following selection with the appropriate drugs for cells which express both 
constructs, p-gal activity is quantitated using the chemiluminescent assay described above. 
P-gal activity indicates that heterodimerization of the fusion proteins is occurring in this 
cell type. If P-gal activity is detected, individual cells are analyzed using a fluorescent X- 
gal stain in order to determine if the heterodimers are present in the nucleus. Since wild- 
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type p-gal can be specifically directed to and detected in the nucleus by inclusion of a 
nuclear localization sequence (nls) (Hughes and Blau, Nature, 345:350-352 (1990)), 
activity from the p-gal hybrid protein may be detected in the nucleus. Knowledge of the 
site of localization in the cytoplasm or nucleus will aid in assessing the function of the 
5 protein interactions, e.g. sequestration and inhibiting activity, or promoting activity. This 

method permits visualization of fluorescent markers of myogenesis, such as desmin, and 
creatine kinase, in correlation with the localization of P-gal, using the sensitive Fluor-X- 
Gal substrate described above (Mohler, W. A., & Blau, H. M., Proc. Natl Acad ScL, 
USA, 93:12423-12427 (1996)). 

10 All fusion constructs between myogenic regulators and complementing p-gal 

mutants described in the following sections may be tested in a muscle cell where 
heterodimerization of the endogenous myogenic regulator is known to occur. In addition, 
the following controls also may be performed. The myoD-Aa construct may be 
contransduced into the cell with FKBP12-Ao), and the E12-Aco construct may be 

15 cotransduced with FRAP-Aa. This combination of constructs should result in no P-gal 

activity, unless some unusual mechanism exists in the particular cell type being tested that 
enhances complementation of the weakly complementing P-gal peptides independent of 
heterodimerization of the non-P-gal parts of the molecule. The FRAP-Aa and FKBP12- 
A© may also be cotransduced and cells treated with rapamycin as a positive control for 

20 complementation in each cell type. Cells in high serum medium (growth medium) and 

cells in low serum medium (differentiation medium) should/will give different results. 

Example 5: In vivo assay for the effect of growth factors and substrates on 
heterodimerization and homodimerization. 

Using the constructs described above in Example 4, C2C12 myoblasts are 

25 transduced with one of the myogenic HLH fusion constructs and the El 2- A© construct. 

Although C2C12 cells will already contain endogenous myogenic HLH proteins and El 2, 
the chimeric constructs will act as a. "tracer" to measure the extent of heterodimerization. 
Transduced cells then may be stimulated to either differentiate or proliferate by changes in 
serum levels or the addition of growth factors (TGF-p, bFGF, IGF-I and IGF-II) in the 

30 presence or absence of substrates such as fibronectin or laminin. P-gal activity then is 

measured as a function of time. Rapid changes in p-gal activity after growth factor 
stimulation may suggest a more direct mechanism of action of a given extracellular signal 
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on the formation of specific heterodimers. Slower changes may indicate that the 
extracellular signal acts indirectly, for example by up-regulating the expression of a 
competing factor which can sequester one or both fusion proteins. Changes in 0-gal 
activity may be correlated with the expression levels of known inhibitors of differentiation 

5 such as Id proteins, measured by Northern blot in parallel samples. A comparison of the 

kinetics of changes in p-gal activity obtained with each pair of test proteins in parallel 
experiments will indicate whether specific MRFs (muscle regulatory factors, Yun et al 
(1996) Curr. Opiru Cell Biol 8:877-879; and Cossu et al (1996) Trends Genet, 12:218- 
223) or inhibitors differ in their ability to respond to extracellular signals. When a growth 

10 factor or substrate capable of influencing heterodimer formation (or nuclear translocation) 

is identified, the experiments are repeated in other, non-myogenic cell types. The analysis 
of the effect of a specific growth factor in different cell types indicates whether the 
intracellular components of the corresponding signal transduction pathway are tissue- 
specific. 

15 These studies in tissue culture cells permit the relative affinity and 

compartmentalization of specific protein partners under conditions of growth and 
differentiation, and subsequently in response to known signal transducers, to be evaluated. 
The interactions of these factors may be tested in a relevant physiological background in 
competition with the prevalent endogenous components present in the cell at the time. 

20 Most analyses of the interactions of myogenic factors performed thus far have been carried 

out in vitro, in purified systems, or in yeast (Benezra et al, Cell, 61:1213-1230 (1990); 
Lassar et al, Cell, 66:305-315 (1991); Hu et al, Mol Cell Biol, 12:1031-1042 (1992); 
Chen et al, Cell, 86:731-741 (1996); and Spicer et al, Science, 272:1476-1480 (1996). 
The relatively low sensitivity of the biochemical methods used to directly detect 

25 interactions in mammalian cells, such as immunoprecipitation or activation of a reporter 

gene construct, required high levels of protein and overexpression of the construct, usually 
obtained by transient transfection, levels that could potentially force an interaction due to 
increased concentration. The methods disclosed herein permit protein-protein interactions 
that are functionally relevant at different points in the myogenic differentiation pathway to 

30 be studied. Clearly, the extracellular and intracellular milieu determines the stoichiometry 

and abundance of the these proteins at different times. As a result, competition of different 
proteins for the same dimerization partners, cofactors, and kinases or phosphatases in 
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signal transduction pathways could have significant effects on which complexes actually 
form in intact cells. To assess the nature of such endogenous interactions, low expression 
levels are needed in order not to alter the levels inherent to the cell and characteristic of the 
"competitive" environment at a given time. Advantageously, high-level expression of the 
5 introduced proteins is not required in the systems described herein in order to assess the 

protein-protein interactions of interest. Indeed, by contrast with transient transfection 
assays or even most retroviral vectors with strong promoters and high translation 
efficiencies, the systems disclosed herein provide levels that should not perturb the natural 
endogenous physiological levels of the proposed test proteins in the cell. 

10 Example 6 : Analysis of inhibitory and myogenic HLH proteins in mice. 

The heterodimerization of inhibitory and myogenic HLH proteins in mice may be 
mapped. Mtwist and I-mf have been shown to inhibit myogenesis in mammalian tissue 
culture systems. In addition, they have been proposed to act via direct physical association 
with myogenic HLH proteins (Hebrok et al, Dev. Biol, 165:537-544 (1994); Rohwedel et 

15 al, Exp. Cell Res., 220:92-100 (1995); Chen et al, Cell, 86:731-741 (1996); Spicer et al, 

Science, 272:1476-1480 (1996)). During embryogenesis, Mtwist is expressed throughout 
the epithelial somite and is later excluded from the myotome (Fuchtbauer, Dev. Dyn., 
204:316-322 (1995); and Stoetzel et al, Mech. Dev. 51 :251-263 (1995)). Although I-mf 
expression has not been analyzed at early stages of somitogenesis, at 1 1.5 days post-coitum 

20 I-mf is highly expressed in the sclerotome but is excluded from the myotome (Chen et al, 

Cell, 86:73 1-741 (1996)). Thus, based on their expression domains in the embryo, these 
factors are thought to be critical for spatial and temporal restriction of the myogenic 
program in early development. 

Further support for this hypothesis derives from analyses of myfS/lacZ embryos in 

25 which the myfS coding region has been targeted and replaced by lacZ. Using p-gal as a 

marker of the myf5 expression pattern, cells expressing myf5 are detected in the presomitic 
mesoderm, where Mtwist is also expressed (Fuchtbauer, Dev. Dyn., 204:316-322 (1995); 
and Stoetzel et al, Mech. Dev. 5125 1-263 (1995)), long before the onset of myogenesis 
(Cossu et al, Trends Genet., ]2:21 8-223 (1996)). Later in development, myf5 and myoD 

30 are co-expressed together with Mtwist in the somite before the formation of a distinct 

myotome. Ott, et al, Development lUr. 1097-1 107 (1991); Fuchtbauer, Dev. Dyn., 
204:316-322 (1995); Stoetzel et al, Mech Dev. 51:251-263 (1995); and Cossu et al, 
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Trends Genet, 12:218-223 (1996)). These cells do not express other detectable myogenic 
markers (Ott, et al., 1991). Thus, the reporter systems disclosed herein may be used to 
determine if the myf5 and MyoD proteins in these cells are maintained in an inactive state 
by interaction with Mtwist and/or I-mf in heterodimers. At subsequent stages of 
development, Mtwist and I-mf are expressed in most of the non-myogenic mesoderm, 
where the expression of myogenic factors is excluded. Smith et al, J. Cell Biol, 127:95- 
105 (1994); Fuchtbauer, Dev. Dyn., 204:316-322 (1995); Stoetzel et al, Meek Dev. 
51:251-263 (1995); and Chen et al, Cell, 86:731-741 (1996). Possibly Mtwist and I-mf 
are involved in the creation of a sharp border between the myotome and the adjacent 
tissues at this stage. 

The reporter systems disclosed herein permit detailed studies of the interactions 
between myogenic inhibitors and activators in vivo during embryonic development which 
can provide novel insights into the complex process of patterning during somitogenesis. 
Such studies are not limited to mice and can easily be performed in C. elegans, 
Drosophila, Xenopus, zebrafish and other experimental organisms. To date, a 
methodology that allows visualization of protein complexes in situ in the embryo has not 
been available. As a result, no definitive evidence is available as to when and where 
during embryonic development interactions of such HLH heterodimers might occur. 
Example 7: Detection of HLH heterodimers in mouse embryos 
The p-gal complementation assay is well-suited for the detection of protein-protein 
interactions in vivo. Myf5-Aa, MyoD-Aa and Mtwist-Aco fusion proteins may be 
constructed. Mediation of P-gal complementation with these fusion proteins may be tested 
in the course of performing the experiments described above. Using well-established 
transgenic technology (Thomas and Capecchi, Nature, 324:34-38 (1986); and Capecchi, 
Science, 244: 1288-1292 (1989)), mouse lines may be generated in which one of the myf5, 
MyoD or Mtwist alleles has been replaced with the corresponding fusion protein. Thus 
myf5-Aa, MyoD-Aa and Mtwist-Aco fusion proteins will be expressed under the control of 
their endogenous promoters. The expression of the test protein can be verified in these 
mice. The Mtwist-Ao transgenic mouse may then be crossed with the myf5-A<x, and the 
MyoD-Aa transgenic mouse lines, and in each case the offspring may be analyzed in order 
to identify those carrying both of the fusion proteins. P-gal activity should only develop in 
those cells of the embryo in which Mtwist-Aco physically associates with the myf5-Aa or 
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the MyoD-Aa fusion proteins. This analysis allows mapping when and where during 
embryonic development Mtwist is actually interacting with myf5 and MyoD to repress the 
myogenic phenotype. 

Example 8: Targeting strategy and engineering of necessary constructs 
The myf5-Aot fusion protein coding sequence may be inserted into the myf5 locus 
so that it will be expressed under the control of the endogenous myf5 regulatory elements. 
A similar insertion of wild type P-gal in the myf5 locus resulting in a fusion with the ATG 
of myf5 has been shown to reproduce faithfully the expression pattern of the endogenous 
gene. The targeting construct is based on the published my fS/lacZ targeting construct 
(Tajbakhsh and Buckingham, Proc. Natl Acad. Sci. USA, 91:747-751 (1994); Tajbakhsh 
et aL 9 Neuron, 13:813-821 (1994); and Tajbakhsh et aL 9 Nature 384:266-270 (1996)), but 
with the following differences: (1) The fusion protein contains the complete myf5 coding 
sequence fused to the Aa P-gal. (2) The fusion protein coding sequence is followed by a 
neomycin resistance gene flanked by FRT sites (FLP recombinase targets). This allows 
G4 1 8 selection of ES ceils that have taken up and integrated the targeting construct. (3) A 
diphtheria toxin expression cassette is located 5' of the region of homology with the myf5 
mouse genomic DNA. During homologous recombination, strand exchange will occur 
within the homology region and as a result the diphtheria toxin expression cassette will be 
excluded following integration (Capecchi, Science, 244: 1288-1292 (1989)). Clones 
resulting from random integration rather than homologous recombination retain diphtheria 
toxin expression and will be selected against during culture, because they will die. The 
surviving clones are characterized by PCR, and the appropriate integration of the construct 
in the myf5 genomic locus is confirmed by Southern blot. 

Subsequently, the neomycin selection cassette is removed using a modified version 
of a previously described technique (Fiering et ai 9 Genes Dev., 9:2203-2213 (1995)). 
Briefly, a plasmid expressing a bicistronic message containing FLP recombinase, an 
Internal Ribosomal Entry Site (IRES) and GFP is transiently transfected into the ES cell 
clones. GFP positive cells are clonally sorted using the fluorescence activated cell sorter 
(FACS). In these cells, FLP deletes the sequences between the two FRT sites, and only the 
p-gal coding sequence remains in the ES cell genome. Aliquots of the sorted clones are 
tested for sensitivity to G41 8, and in the sensitive clones the accurate deletion of the 
neomycin cassette is confirmed by PGR and Southern blotting. This approach, which 
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eliminates the selectable marker, avoids interference between the exogenous promoter 
driving the selectable marker and the endogenous regulatory sequences as described 
(Olson etaL, Cell, 85:1-4 (1996)). 

Targeting constructs for MyoD and Mtwist have also been described (Rudnicki et 
al, Cell, 71 :383-390 (1992); Chen and Behringer, Genes Devel , 9:686-699 (1995)) and 
the relevant constructs may be produced for each. Based on these available reagents, and 
following the scheme proposed above for the myf5«Aa strategy, vectors to target (Chen 
and Behringer, Genes Devel, 9:686-699 (1995)) MyoD-Aa and Mtwist-Aco fusions into 
the endogenous MyoD and Mtwist loci of ES cells may be constructed. In each case, an 
ES cell line syngeneic to the available genomic DNA homology regions in the targeting 
construct are used, as strain differences are known to reduce the frequency of homologous 
recombination. The same FLP-mediated excision methodology used for the myfS "knock 
in" described above is applied to the deletion of the neomycin resistance markers from the 
targeted MyoD and Mtwist loci. This "in-out" strategy ensures that the fusion protein 
coding regions are under the control of the endogenous regulatory elements and associated 
with minimal extraneous flanking DNA sequences. 

Example 9: Analysis of the mvfS-Aa/Mtwist-Aco and 
MvoD-Aa/Mtwist-Aco transgenic lines 

For each construct, multiple ES cell clones are injected into blastocysts. The 
chimeric offspring obtained upon implantation into pseudopregnant females are tested for 
germline transmission, and heterozygous mice are obtained. One critical control in this 
experiment is to confirm that the expression pattern of the "knocked-in" fusion proteins 
faithfully mimics that reported for the corresponding endogenous factors. For this 
purpose, a system allowing rapid detection of the fusion proteins is provided. A transgenic 
mouse strain expressing a p-gal mutant (A^i) capable of strong complementation with 
either Aa or A<p is generated. An is expressed ubiquitously from the strong chicken p- 
actin promoter. MyoD-Aa, myf5-Aa and Mtwist-Aco transgenic mouse lines are each 
crossed with the An transgenic mice. Since co-expression of any of these fusion proteins 
with the strongly complementing Afi mutant should result in readily detectable P-gal 
activity, it is thus possible to follow the expression pattern of our fusion proteins by X-gal 
staining of the embryos. 
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The Mtwist-Aa> mouse line is crossed with MyoD-Aa and myf5-Aa transgenic 
mouse lines. As heterozygous mice are used for these crosses, on average 1/4 of the 
embryos will be double heterozygotes. These embryos are analyzed at different time 
points during development by staining whole mount preparations and histological sections 
5 with X-gal. The sections also are stained with the more sensitive Fluor-X-Gal fluorescent 

substrate (Mohler, W. A., & Blau, H. M, Proc. Natl Acad. ScL, USA, 93:12423-12427 
(1996)), to detect those cells in which the Mtwist-MyoD or the Mtwist-myf5 interaction is 
a rare event and the P-gal signal is consequently lower. 

The strength of this approach is that P-gal activity should only appear in cells in 
10 which the interactions described above take place in vivo. This approach allows a 

thorough analysis of the interplay between inhibitors and activators of myogenesis during 
development. In particular, it allows analysis of the correlation between co-expression and 
a physical interaction of two proteins as heterodimers in an in vivo setting, the developing 
mouse embryo. This is particularly important in the case of factors which, like Mtwist, are 
15 known to be involved in multiple control steps (Chen and Behringer, Genes Devel, 9:686- 

699 (1995)) and are likely to carry out their functions through interaction with different 
determination factors. 

The use of p-gal complementation mutants also can be extended to an analysis of 
I-mf. I-mf has also been implicated as a negative regulator of myogenesis in the embryo 
20 (Chen et al, Cell, 86:731-741 (1996)). Interestingly however, I-mf and Mtwist are co- 

expressed throughout most of the somite. It is not clear if their presence in the same cells 
is merely an indication of the existence of redundant mechanisms for repressing the 
activity of the myogenic HLH regulators or whether the two factors have distinct 
functions. In the first case, the prediction would be that both I-mf and Mtwist associate 
25 with the same factors. In the second case, differences and interactions with different 

factors should be detectable using our experimental approach. 

Example 10: Analysis of protein interactions by Fluorescence- Activated 

Cell Sorting (FACS) 
The P-gal activity of a population of C2C12 cells that were coinfected with 
30 FRAP- Act and FKBP12-A<a (as described in Examples 1 and 2) was assayed in the 

presence and absence of 10 ng/ml rapamycin by FACS. FACS was carried out according 
to methods that are well-known in the art, e.g., Nolan et al (1988) Proc. Natl Acad Set 
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USA 85 :2603-2607. Using this assay, increased p-gal activity was detected in the majority 
of cells after 90 minutes of rapamycin treatment (Figure 6A). A range of expression levels 
was observed, as evidenced by the breadth of the peak of emission in the presence and 
absence of the drug (compare light and dark profiles in Figure 6A). This breadth is 
presumably due to variable efficiency of expression of each of the retroviral vectors 
following integration in the target cell. This inference is supported by the finding that 
when the 25% of cells expressing the lowest p-gal activity in the absence of rapamycin 
were collected (Figure 6B) and reassayed in the presence and absence of rapamycin, the 
treated and untreated cell populations yield non-overlapping peaks by FACS analysis, 
indicating a clear distinction between the treated (light peak) and untreated (dark peak) 
populations (Figure 6C). Thus, non-overlapping populations of cells distinguished by the 
expression (or non-expression) of complementing fusion proteins can be identified and 
isolated by FACS. 

Example 1 1 : Monitoring of EGF Receptor Dimerization in Living Cells 
A previously unrecognized mode of regulation of the epidermal growth factor 
(EGF) receptor signaling pathway that acts through receptor dimerization was revealed 
using the methods of the invention for monitoring protein-protein interactions at the 
membrane of live cells. Chimeric proteins containing the extracellular and transmembrane 
domains of the EGF receptor, fused to weakly complementing p-galactosidase (p-gal) 
deletion mutants, were expressed in myoblasts. Treatment of the cells with EGF resulted 
in chimeric receptor dimerization as assessed by a rapid increase in P-gal enzymatic 
activity. Further treatment with EGF did not restimulate dimerization unless an inhibitor 
of EGF receptor tyrosine kinase was added. These results reveal a feedback mechanism in 
which tyrosine kinase activity of the dimeric receptor inhibits further dimerization of the 
receptor. 

Methods 

Construction of chimeric receptors. The weakly complementing Aa and 

Aco deletion mutants of p-gal were each linked to a polypeptide sequence containing the 
extracellular and transmembrane domains of the human EGF receptor to form chimeric 
receptor molecules. The chimeric receptors lacked the cytoplasmic domain, and attendant 
tyrosine kinase activity, of the native receptor. The procedure was as follows. The 
sequence coding for the extracellular and transmembrane domains of the human EGF 
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receptor (amino acids 1-655) was amplified by polymerase chain reaction (PCR) using 
primers that incorporated an Ncol site at the 5' end and an Xhol site at the 3' end of the 
PCR product. Although this fragment retains threonine 654, which is a site of protein 
kinase C (PKC) phosphorylation, arginines 656 and 657 are removed, destroying the 
consensus PKC recognition sequence. The amino acid sequence beginning with threonine 
654 is thr-leu-glu-ser-met, with the met residue being the beginning of the P-gal sequence. 
The glu and ser codons are generated by the junction sequence and are not native to either 
EGF or p-gal. 

DNAs encoding the chimeric receptors were inserted into a retroviral vector also 
encoding a selectable marker. For the construct containing the EGF receptor- Aa fusion, 
the selectable marker was the neo gene, encoding G418 resistance; while the EGF 
receptor-Aco fusion specified hygromycin resistance (Figure IB). Accordingly, the EGF 
receptor PCR product was digested and cloned into the Ncol and Xhol sites of the 
pWZL-Aa and pWZL-Aco vectors. The pWZL-Aa-neo and pWZL-Aco-hygro plasmids 
were constructed by cloning the lacZ Aa and A© deletion mutants into pWZL-neo and 
pWZL-hygro, respectively. Mohler et al, supra; and Rossi et al (1997) Proc. Natl. Acad 
Sci. USA 94:8405-8410. Plasmids were transfected into <J>NX cells using Lipofectamine 
(Life Technologies), and virus-containing supernatant was harvested 48-72 hours later. 
C2F3 mouse myoblasts (Rastinejad et al (1993) Cell 72:903-917) maintained in DME 
with 20% fetal bovine serum (FBS) in 10% C0 2 , were infected by overnight incubation in 
the viral supernatant. Cells containing both constructs were selected in 1 mg/ml G418 plus 
1 mg/ml hygromycin, and were maintained in 400 jig/ml of each antibiotic. 

EGF treatment and FACS analysis Cells were treated with mouse salivary 

gland EGF (Sigma) at 100 ng/ml and in some experiments were treated with tyrphostin 
AG1478 (Calbiochem) at 100 nM. Following all treatments, cells were rinsed with 
phosphate buffered saline (PBS), trypsinized, and resuspended in PBS + 5% FBS. 
Fluorescein di-P-D-galactopyranoside (FDG; Molecular Probes) was loaded into the cells 
by hypotonic shock as described. Fiering et al (1991) Cytometry 12:291-301 and Nolan 
et al (1988) Proc. Natl Acad Set USA 85:2603-2607. Cells were kept on ice until 
analysis on the cell sorter, which was conducted 1 to 2 hours after trypsinization. 

The chimeric receptor was detected by immunofluorescence using a monoclonal 
mouse anti-human EGF receptor antibody diluted 1 :100 (clone EGFR1, Dako) and either 
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phycoeiythrin-labeled horse anti-mouse IgG (Vector) or fluorescein-labeled goat anti- 
mouse IgG (Cappel) diluted 1:100. Cells were trypsinized and stained in PBS + 5% FBS. 
For each sample, FACS analysis data was collected for 5000 cells. Cells were cloned on a 
Becton-Dickinson FACS Vantage and analyzed on a Becton-Dickinson FACScan at the 

5 Stanford Shared FACS Facility. Data analysis was facilitated by FlowJo software (Tree 

Star, Inc.). Data shown here as FACS profiles were adjusted for autofluorescence using 
autofluorescence compensation. Alberti et al (1987) Cytometry 8: 1 14-1 19. Mean 
fluorescence data were adjusted for autofluorescence and for endogenous mammalian P- 
gal activity by subtracting the mean fluorescence of untransduced cells loaded with FDG 

10 substrate. 

Results 

Receptor dimerization assay. The two chimeric DNAs were each cloned 

into retroviral vectors encoding selectable markers (Fig. 7B) and transduced into the C2F3 
mouse myoblast cell line. After selection with G41 8 and hygromycin, P-gal enzyme 

15 activity was monitored using the fluorescence activated cell sorter (FACS) to measure the 

cleavage product of a fluorogenic substrate. In the absence of EGF, the population of 
transduced cells consisted of a mixture of cells with low and high levels of p-gal activity 
(Fig. 7C, light gray curve), which was not unexpected given that the EGF receptor is 
capable ofdimerizing in the absence of EGF. Gadellaetfa/. (1995) 1 Cell Biol 129:1543- 

20 1 558. Following stimulation of the population of cells with EGF many of the cells 

exhibited increased P-gal activity (Fig. 7C, dark gray curve). FACS analysis with an 
antibody specific to the human EGF receptor showed that the cells expressed a broad range 
of levels of the chimeric receptor (Fig. 7D, medium gray curve). Clones from this 
population were isolated and screened for low background levels of P~ gal activity in the 

25 absence of EGF, and increased levels of P-gal activity in the presence of EGF. One such 

clone had a low level of chimeric receptor expression relative to the population (Fig. 7D, 
dark gray curve) and exhibited a several-fold increase in P-gal activity in the presence of 
EGF (Fig. 7E), indicating dimerization of the chimeric receptor. Dimerization was also 
observed following stimulation with other EGF-like growth factors that bind and activate 

30 the EGF receptor, such as TGF-ct, heparin-binding EGF-like growth factor, and 

betacellulin; but not with EGF-like factors, such as heregulin a, that act through related 
receptors other than the EGF receptor. Beerli et al (1996) J. Biol Chem. 271:6071-6076. 
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Dimerization, expressed as the mean fluorescence or |3-gal activity of the cells, could be 
detected with EGF treatments as short as one minute, and dimerization increased rapidly 
with longer exposure to EGF (Fig. 7F). 

Tifne-course of EGF Receptor dimerization. In order to follow the fate of 

receptor dimers over time, cells from the same clone described above were cultured in 
media containing EGF for 0 to 24 hours and then analyzed by FACS. Dimerization peaked 
after 2 to 4 hours in EGF, and then decreased (Fig. 8). The fold increase in dimerization 
and the rate of the ensuing decline in dimerization differed among experiments, but the 
overall pattern was consistent, and was also observed with the original population of 
uncloned cells. By contrast, measurement of the levels of the chimeric receptor on the cell 
surface by immunofluorescence using the FACS showed that the amount of chimeric 
receptor on the cell surface remained essentially constant over the period that dimerization 
markedly decreased (Fig. 8, dashed line). It was concluded that the decline in dimerization 
was due to either the depletion of EGF from the media, or to an inhibition of receptor 
dimerization. 

Feedback regulation of EGF Receptor dimerization During the decline in 

dimerization, the response to a second EGF treatment was minimal, suggesting that the 
cells were resistant to further EGF-mediated dimerization despite the continued presence 
of the chimeric receptor on the cell surface. By contrast, if, following EGF treatment, cells 
were incubated in media lacking EGF for several hours, dimerization could be restimulated 
with a second treatment of EGF. This indicated that the continued presence of EGF in the 
media was the basis for the continued inhibition of dimerization of the receptor. A 
possible explanation for these results is that signaling through the endogenous wild-type 
EGF receptors in the cells inhibits dimerization of the chimeric receptor. A test of this 
hypothesis was possible, using AG1478, a highly specific inhibitor of the EGF receptor 
tyrosine kinase. Levitzki et al. ( 1 995) Science 267: 1 782- 1 788. 

Accordingly, cells expressing chimeric receptor were treated with EGF overnight, 
and then retreated with EGF or tyrphostin. As shown in Figure 9A (left panel), sample I 
received a single overnight treatment with 100 ng/ml EGF. Samples II and III also were 
treated with EGF overnight, and then retreated with 100 ng/ml EGF for 2 hours (sample 
II), or 100 nM tyrphostin AG1478 for 2 hours (sample III). Sample IV received a single 2 
hour treatment with 100 ng/ml EGF, and sample V received no treatment. The results 
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(Figure 9 A, right panel) show that treatment of the cells with tyrphostin led to an increase 
in dimerization, yielding diraerization levels that were comparable to the peak levels 
observed after a single two hour treatment with EGF, indicating that EGF receptor tyrosine 
kinase activity is involved in inhibiting receptor dimerization. Tyrphostin treatment also 
caused an increase in the amount of p-gal activity observed when previously unstimulated 
cells were treated with EGF. Cells were treated with EGF and tyrphostin, or EGF alone, 
over periods ranging from 0-24 hours. Cells that received both tyrphostin and EGF 
showed greater p-gal activity than cells that received EGF alone, for treatment times of up 
to 6 hours (Fig. 9B). By 8 hours of treatment, there was no difference in EGF receptor 
dimerization between EGF-treated cells and EGF+tyrphostin-treated cells. Repeated 
administration of tyrphostin every four hours did not further prolong the increased p-gal 
activity. 

These results show that inhibition of receptor tyrosine kinase can relieve a feedback 
inhibition of receptor dimerization. Protein kinase C phosphorylation can decrease 
receptor binding affinity for EGF by phosphorylating the receptor on sites in the 
cytoplasmic domain. However, since the chimeric receptor used in the experiments 
described herein lacks the known sites of PKC phosphorylation, the inhibition of 
dimerization observed with this receptor must be mediated through the extracellular or 
transmembrane regions of the receptor. 

These results also demonstrate that, using the methods and compositions of the 
invention, it is possible to monitor EGF receptor dimerization in live cells. They show, in 
addition, that receptor kinase activity is involved in regulating dimerization, the first step 
after ligand binding in EGF signal transduction. Dimerization is measurable following 
treatment of cells with EGF after as little as one minute, which indicates that the p-gal 
complementation is able to monitor the rapid production of newly formed protein dimers. 
Previous data on EGF binding, receptor internalization, and substrate phosphorylation also 
indicate that the receptor responds to ligand within minutes. Felder et al (1992) J. Cell 
Biol 117:203-212; and Kiyokawa et al. (1997)7. Biol Chem. 272:18656-18665. 
Although receptor dimerization declines after a few hours, the chimeric receptor remains 
on the cell surface and is refractory to further dimerization in response to EGF. Inhibition 
of the endogenous receptor tyrosine kinase, however, permits further dimerization. 
Inhibition of receptor dimerization begins immediately following receptor activation, as 
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shown by the observation that including tyrphostin with the initial EGF treatment increases 
dimerization over the levels observed with EGF alone. 

The kinetics of complementation reflect the kinetics of association of the binding 
partners The decline in EGF receptor dimerization is in contrast to 

5 observations using P-gal complementation to monitor the interaction of FRAP and 

FKBP12. See Examples 1, 2 and 10, supra; see also Rossi et al (1997), supra. Using 
P-gal complementation to detect the rapamycin-mediated interaction between FRAP and 
FKBP12, the slowest increase in p-gal activity was seen at the earliest time points 
following the addition of rapamycin, but p-gal activity continued to increase for at least 20 

10 hours. This could be due to stabilization of the chimeric protein interactions by formation 

of the active P-gal complex. With EGF receptor dimerization, however, the most rapid 
increase in P-gal activity was seen at the earliest time points after the addition of EGF to 
the media; whereas, after 2 to 4 hours, the P-gal activity declined. The difference between 
these results indicates that the dimerization kinetics observed with P-gal complementation 

15 are not simply a reflection of P-gal complementation kinetics or stabilization, but reflect, at 

least to some degree, the kinetics of interaction of the non-P-gal portions of the chimeric 
proteins. The results also show that P-gal complementation can monitor the regulation of 
dimerization by other proteins. 

Comparison to previous methods Receptor dimerization has typically 

20 been studied by in vitro methods such as chemical cross-linking and immunopurification, 

followed by gel electrophoresis. Yarden et al (1987) Biochemistry 26:1443-1451. 
Recently, EGF receptor dimerization has also been analyzed by fluorescence resonance 
energy transfer (FRET). Gadella et al (1995) supra. Fluorescein and rhodamine labeled 
EGF was added to cells, and dimerization of the receptor was measured microscopically. 

25 Low temperature incubations and fixation of the cells was required to prevent 

internalization of the receptor before analysis, a problem that was avoided in the present 
experiments by using a non-internalizing mutant receptor. FRET can also be used to study 
interactions of fluorescently-labeled molecules within the cell or cell membrane; however, 
labeling and introduction of these molecules at sufficiently high concentration can be 

30 cumbersome. It has recently been shown that green fluorescent protein can be modified 

and used for FRET analysis on genetically expressed proteins. Miyawaki et al (1997) 
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Nature 388:882-887. The GFP signal, however, cannot be enzymatically amplified as is 
the case with (3-gal. 

Thus, P-gal complementation provides a rapid method for monitoring receptor 
dimerization in live cells. This method can be used for high throughput screening for 
pharmacological agents that can bind to a number of receptors and act as either agonists or 
antagonists. Binding data alone cannot indicate whether or not an agent can elicit a 
response; identifying a response, by analysis of downstream effects such as 
phosphorylation, involves destruction of the cells followed by in vitro analysis. P-gal 
complementation will also enable a screen for novel dimerization partners in a mammalian 
"two-hybrid" assay that, in the case of membrane receptors, can offer new insight into the 
regulation of signal transduction pathways. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be apparent to 
those skilled in the art that certain changes and modifications may be practiced. Therefore 
the foregoing descriptions and examples should not be construed as limiting the scope of 
the invention. 
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CLAIMS 

What is claimed is: 

1 . A reporter system component comprising: 

a first low-affinity reporter subunit, coupled to a first putative binding moiety; 

wherein the first low-affinity reporter subunit is capable of association with at least 
a second low-affinity reporter subunit to generate a detectable signal, said association 
being mediated by the first putative binding moiety. 

2. The reporter system component of claim 1 wherein the first putative binding 
moiety is a protein. 

3. The reporter system component of claim 2 wherein the protein is selected 
from the group consisting of members of a signal transduction cascade, cell surface 
receptors, proteins regulating apoptosis, proteins that regulate progression of the cell-cycle, 
proteins involved in the development of tumors, transcriptional-regulatory proteins, 
translational regulatory proteins, proteins that affect cell interactions, cell adhesion 
molecules, proteins which are members of ligand-receptor pairs, proteins that participate in 
the folding of other proteins, and proteins involved in targeting to intracellular 
compartments. 

4. A reporter system comprising the reporter system component of claim 1 and 
further comprising at least a second low-affinity reporter subunit coupled to a second 
putative binding partner of the first putative binding moiety. 

5. The reporter system of claim 4 wherein the binding affinity of the putative 
binding moieties for each other is greater than the binding affinity of the first and second 
reporter subunits for each other. 

6. The reporter system of claim 5 wherein the production of the signal is 
dependent upon the binding of the putative binding moieties. 



WO 98/44350 PCT/US98/06648 

52 

7. The reporter system of claim 4 wherein the first and second putative 
binding moieties are proteins. 

8. The reporter system of claim 7 wherein the protein is selected from the 
5 group consisting of members of a signal transduction cascade, cell surface receptors, 

proteins regulating apoptosis, proteins that regulate progression of the cell-cycle, proteins 
involved in the development of tumors, transcriptional-regulatory proteins, translation^ 
regulatory proteins, proteins that affect cell interactions, cell adhesion molecules, proteins 
which are members of ligand-receptor pairs, proteins that participate in the folding of other " 
10 proteins, and proteins involved in targeting to intracellular compartments. 

9. The reporter system of claim 7, wherein the first and second reporter 
subunits each comprise a protein, and wherein said proteins are capable of associating to 
catalyze a reaction to produce a detectable signal. 

15 

10. The reporter system of claim 9 wherein the associated reporter subunits 
catalyze a reaction to produce a product which is directly detectable as the detectable 
signal. 

20 11. The reporter system of claim 7 wherein the first and second subunits are 

low affinity binding mutant subunits of a hydrolytic enzyme. 

12. The reporter system of claim 1 1 wherein the first and second subunits are 
low affinity binding mutant subunits of P-galactosidase. 

25 

13. The reporter system component of claim 1 wherein said component 
comprises a fusion protein including said low affinity reporter subunit and said first 
putative binding moiety. 



30 



1 4. The reporter system component of claim 1 3 wherein the low affinity 
reporter subunit comprises a low affinity binding mutant subunit of p-galactosidase. 
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15. A nucleic acid encoding the fusion protein of claim 1 3 . 



16. The nucleic acid of claim 1 5 further comprising regulatory sequences 
effecting expression of the putative binding protein. 

17. A viral vector comprising the nucleic acid of claim 15. 

1 8. A cell transformed with the nucleic acid of claim 1 5. 

19. A cell transformed with a nucleic acid encoding the reporter system 
component of claim 13 and a nucleic acid encoding at least a second of said reporter 
system components. 

20. The reporter system component of claim 1 4 wherein the fusion protein 
further comprises an additional protein sequence between said reporter subunit and said 
putative binding moiety. 

21. A method of determining the occurrence of binding between first and 
second putative binding moieties, the method comprising: 

a) providing a reporter system comprising: 

a first component comprising a first low affinity reporter subunit, 
coupled to the first putative binding moiety, and 

a second component comprising a second low affinity reporter 
subunit coupled to the second putative binding moiety; 

wherein the first low affinity reporter subunit is capable of association with 
at least the second low affinity reporter subunit to generate a detectable signal, said 
association being mediated by the binding of the first and second putative binding 
moieties; 

b) combining the first component and the second component; and 

c) detecting the presence or absence of the signal. 
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22. The method of claim 21 wherein the binding affinity of the putative binding 
moieties for each other is greater than the binding affinity of the first and second reporter 
subunits for each other. 

5 23. The method of claim 21 wherein the first and second putative binding 

moieties are proteins. 

24. The method of claim 23 wherein the protein is selected from the group 
consisting of members of a signal transduction cascade, cell surface receptors, proteins 

10 regulating apoptosis, proteins that regulate progression of the cell-cycle, proteins involved 

in the development of tumors, transcriptional-regulatory proteins, translational regulatory 
proteins, proteins that affect cell interactions, cell adhesion molecules, proteins which are 
members of ligand-receptor pairs, proteins that participate in the folding of other proteins, 
and proteins involved in targeting to intracellular compartments. 

15 

25. The method of claim 21, wherein the first and second reporter subunits each 
comprise a protein, and wherein said proteins are capable of associating to catalyze a 
reaction to produce a detectable signal. 

20 26. The method of claim 25 wherein the associated reporter subunits catalyze a 

reaction to produce a product which is directly detectable as the detectable signal. 

27. The method of claim 26 wherein the first and second subunits are low 
affinity binding mutant subunits of P-galactosidase. 

25 

28. The method of claim 21 wherein each of said first and second components 
comprises a fusion protein. 

29. The method of claim 28 wherein the low affinity reporter subunit comprises 
30 a low affinity binding mutant subunit of P-galactosidase. 
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30. The method of claim 28 wherein step (a) comprises transforming a cell with 
one or more nucleic acids encoding the fusion proteins. 

31. The method of claim 30 wherein step (c) comprises detecting the signal 
within the cell. 

32. The method of claim 30 wherein the one or more nucleic acids encoding the 
fusion proteins further comprise sequences regulating expression of the putative binding 
protein. 

33. The method of claim 30 wherein the fusion proteins are encoded by a viral 

vector. 

34. The method of claim 28 wherein the fusion protein further comprises a 
protein sequence between said reporter subunit and said putative binding moiety. 

35. The method of claim 21 wherein the degree of binding is quantitated. 

36. The method of claim 21 wherein the method further comprises detecting the 
effect of a third moiety on the binding of the first and second binding moieties, the method 
further comprising, after step (a) and prior to step (b), combining said reporter system with 
said third moiety. 

37. The method of claim 21 wherein the method further comprises determining 
potential agonist or antagonist activity of said third moiety. 

38. The method of claim 3 1 wherein the intracellular localization of the signal 
is determined. 

39. The method of claim 21 wherein step (b) comprises combining the first and 
second components in the presence of a substance to determine the effect of the substance 
on binding of the first and second binding moieties. 
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40. The method of claim 39 wherein the substance is a putative binding 
inhibitor of binding moieties having a predetermined binding affinity, and wherein the 
absence of the signal in step (c) provides an indicator that the substance is a binding 
inhibitor. 

41 . The method of claim 39 wherein the substance is a putative promoter of 
binding between binding moieties having low or substantially no binding affinity for each 
other, and wherein the presence of the signal in step (c) provides an indicator that the 
substance is a promoter of binding of the binding moieties. 

42. The method of claim 39 wherein the first and second reporter units, and first 
and second binding moieties, each are proteins; 

wherein the components provided in step (a) each comprise a fusion protein 
including the reporter subunit and the binding moiety; 

wherein step (b) comprises expressing nucleic acid sequences encoding the first 
and second components within a cell suspected to contain the substance which inhibits or 
promotes binding of the binding moieties; and 

wherein step (c) comprises detecting the presence or absence of the signal in the 
cell or iysate thereof, thereby to determine the presence or absence in the cell of the 
substance which acts as an inhibitor or promoter of binding between the binding moieties. 

43. The method of claim 39 wherein the substance is selected from the group 
consisting of a protein, lipid, carbohydrate, nucleic acid and a small molecule 
pharmaceutical. 

44. A method of screening for binding of a first binding moiety with members 
of a plurality of different second putative binding moieties, the method comprising: 

a) providing a plurality of reporter systems each comprising: 

a first component comprising a first low affinity reporter subunit coupled to 
the first binding moiety, and 
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one of a plurality of second components each comprising a second low 
affinity reporter subunit coupled to one of said plurality of second putative binding 
moieties, wherein in each of said second components, said second putative binding 
moiety is different; 

wherein the first low affinity reporter subunit is capable of association with 
the second low affinity reporter subunit to generate a detectable signal upon the 
binding of the first binding moiety with one of said different second putative 
binding moieties; 

b) individually combining the first component with each of the plurality of 
second components to produce a plurality of binding assay samples, each of which 
includes the first component and a different one of the second components; and 

c) detecting the presence or absence of the signal in each of the binding assay 
samples. 

45. The method of claim 44 wherein the first and second components each 
comprise a fusion protein including the binding moiety and the reporter subunit. 

46. The method of claim 45 wherein, in step (b), the components are expressed 
from a nucleic acid sequence introduced into a cell. 

47. The method of claim 46, wherein the plurality of second putative binding 
moieties are encoded by members of a cDNA library. 

48. The method of claim 47, wherein the cell is a eukaryotic cell. 

49. . The method of claim 48, wherein the cell is a mammalian cell. 

50. The method of claim 49, wherein the cell is a human cell. 



5 1 . The method of claim 44, wherein, in step (c), the signal is quantitated. 
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52. The method of claim 44, wherein cells in which binding between the first 
binding moiety and one of the plurality of putative second binding moieties has occurred 
are separated from cells in which said binding has not occurred. 

53. The method of claim 52, wherein separation is by fluorescence-activated 
cell sorting. 

54. The method of claim 44, wherein the first binding moiety is selected from 
the group consisting of cell surface receptors, transcriptional regulatory proteins, 
translational regulatory proteins, replication proteins, splicing proteins, signal transduction 
proteins, cell-cell adhesion molecules, cell-substrate adhesion molecules, cell-cycle 
proteins, oncogene products, tumor suppressor proteins, membrane receptors, proteins 
regulating apoptosis, developmental regulatory proteins, proteins that affect cell 
interactions, proteins that participate in the folding of other proteins, proteins involved in 
targeting to intracellular compartments, viral proteins and cytoskeletal proteins. 

55. The method of claim 39 wherein the substance is a peptide, drug or 
synthetic analog. 

56. The reporter system of claim 4 wherein the first putative binding moiety 
and the second putative binding partner comprise the same molecule. 

57. A method of determining the occurrence of association between first and 
second moieties, the method comprising: 

a) providing a reporter system comprising: 

a first component comprising a first low affinity reporter subunit, 
coupled to the first moiety, and 

a second component comprising a second low affinity reporter 
subunit coupled to the second moiety; 

wherein the first low affinity reporter subunit is capable of association with 
at least the second low affinity reporter subunit to generate a detectable signal, said 
association being mediated by association between the first and second moieties, 
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